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(57) Abstract 

In accordance with the present invention, it has been 
discovered that nuclear receptor proteins isolated from the 
silk moth bombyx mori (bR) are useful for the regulation 
of transgene expression. bR is generally thought to be 
a strong transcriptional regulator within cells of the silk 
moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells. It has further been discovered that the addition 
of activation domains to the bR open-reading frame 
enhances the activity of the ligand modulated regulator 
to afford high-level transcriptional induction. Further 
modifications to the bR ligand binding domain result in 
receptors with unique tranactivational characteristics. 
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Modified Lepidopteran Recent org and Hybrid 
Multifunctional Proteins for Use in Regulation of 
Transaene Expression 
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RELATED APPLICATIONS 

This application claims priority from Provisional 

Application, U.S. S.N. 60/ , filed July 7 , 1998, and 

also claims priority from United States Serial No. 
10 08/891,298, filed July 10, 1997, now pending, the entire 
contents of both of which are hereby incorporated by 
reference herein. 

FIELD OF THE INVENTION 

The present invention relates to methods in the field 
15 of recombinant DNA technology, and products related 
thereto. In a particular aspect, the invention relates to 
methods for modulating the expression of exogenous genes in 
mammalian systems, and products useful therefor. 

BACKGROUND OF THE INVENTION 

20 In the field of genetic engineering, precise control 

of gene expression is an invaluable tool for studying, 
manipulating and controlling development and other 
physiological processes. For example applications for 
regulated gene expression in mammalian systems include 

25 inducible gene targeting, overexpression of toxic and 
teratogenic genes, anti-sense RNA expression, and gene 
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Nat Genet 8:280-284), spinocerebellar ataxias (Kawaguch et 
al. (1994) Nat Genet 8:221-228, Imbert et al . (1993) Nat 
Genet 4:72-76, Sanpei et al . (1996) Nat Genet 14:277-284, 
and Zhuchenko et al . (1997) Nat Genet 15:62-69), and 
5 spinobulbar muscular atrophy (David et al . (1997) Nat Genet 
17:65-70) . PolyQ expansion has also been linked to the 
formation of intracellular protein aggregates in vivo in 
transgenic mice expressing human huntingtin fragments with 
expanded repeats (Ordway et al (1997) Cell 91:753-763), 
10 brain tissue from Huntington's patients (DiFiglia et al . 
(1997) Science 277:1990-1993) and transiently transfected 
cultured cells (Martindale et al (1998) Nat Genet 18:150- 
154 and Cooper et al . (1998) Hum Mol Genet 7:783-790). 

In transient transfection models, a 79 -Q expanded 
15 repeat from the ataxin 3 gene has been shown to form a 
"punctate pattern" within the cytoplasm, and to result in 
the apopotic cell death of 70-80% of transfected COS-7 
cells within 48 hours of transfection. Paulson et al . has 
reported that 293 cells transfected with a separate 78-Q 

2 0 ataxin- 3 derived construct acquired intranuclear inclusion 

bodies and also underwent apoptotic cell death in a subset 
(30%) of the transfected population with 72 hours of 
transfection. Neuroblastoma cells transiently transfected 
with an 82-Q construct derived from an expanded huntingtin 
25 gene exon 1 were found to be 2-3 times more susceptible to 
staurosporine induced apoptosis than cells transfected with 
a LacZ control construct (18) , however, spontaneous cell 
death was not described. Variations in culture techniques, 
transfection efficiencies, and cell lines, however, make it 

3 0 difficult to rigorously characterize the progression of 

polyQ expression, intracellular aggregate (IA) formation, 
and cell death. One impediment to the comparative study of 
IAs in vitro is that transient transfection studies predict 
that toxicity from polyQ over-expression will prevent the 
35 isolation of stable cultured cells. 
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Accordingly, there is a need in the art for improved 
systems to precisely modulate the expression of exogenous 
genes in mammalian subjects. For example, a non- 
mammalian-based transcription regulating system would be 

5 desirable for general application to transgene regulation 
in in vitro, ex vivo, and in vivo applications, as well as 
transgenic animals. Such systems would be extremely 
desirable to conditionally express pathogenic or toxic 
compounds in order to produce stable cells. A system that 

0 is simple, compact and dependent on ligands which are 
relatively inexpensive, readily available and of low 
toxicity in animals would prove useful for stimulation of 
regulated systems. 



BRIEF DESCRIPTION OF THE INVENTION 



15 in accordance with the present invention, it has been 

discovered that nuclear receptor proteins isolated from the 
silk moth bombyx mori (bR) are useful for the regulation of 
transgene expression. bR is generally thought to be a 
strong transcriptional regulator within cells of the silk 

20 moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells without the addition of exogenous dimer partners 
therefor. It has further been discovered that the addition 
of activation domains to the bR open-reading frame (VbR) 

25 enhances the activity of the ligand modulated regulator to 
afford high-level transcriptional induction (see, e.g., 
Figure 1A) . Further modifications to the bR ligand binding 
domain result in receptors with unique transactivation 
characteristics (see, e.g., Figure IB). 

3 0 In accordance with another aspect of the present 

invention, hybrid proteins produced by fusion of modified 
bRs with other ligand-regulated proteins have been found to 
be capable of high level, regulated transactivation of 
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sequences controlled by both hormone response elements and 
tetracycline operators. VbR variants and hybrid proteins 
(see, e.g., Figure 1C) , in combination with the appropriate 
promoters and transgenes, can be introduced into target 
cells by common methods such as transfection of plasmids or 
by virus mediated gene transfer. The small size and 
simplicity of these proteins makes them particularly 
attractive for use in retroviral vectors. 

Invention receptor peptides have proven capable of 
providing a high degree (>100 fold) of ligand-dependent 
transgene expression in a variety of infected and 
transfected cell types including primary cell types. in 
addition, several transgenes including luciferase, LacZ, 
human GH, GFP, tyrosine hydroxylase, and the low-affinity 
15 NGF receptor, have all been shown to function in a 
ligand-dependent fashion in cells infected with invention 
constructs . 



10 



20 



Invention methods provide for regulated gene 
expression by exogenous non-mammalian inducers, and 
therefor can be advantageously employed in a variety of in 
vivo and in vitro mammalian expression systems. For 
example, inducible expression of flp recombinase in 
transgenic mammals, in accordance with invention methods, 
would enable those of skill in the art to accomplish 
25 temporally specific inducible gene targeting of the adult 
or the developing embryo (See, e.g., U.S. Patent Nos . 
5,654,182 and 5,677,177). Stable cell lines conditionally 
expressing desired genes provide a valuable tool for the 
study and treatment of diseases. In order to maximize the 
3 0 chance of obtaining stably transfected/inf ected cultured 
cells expressing a desired transgene, such as polyglutamine 
expression cassette {PQEC) , a retroviral construct is 
provided in which expression of the transgene is controlled 
by an exogenous ligand. 
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For example, in order to study possible mechanisms 
underlying Huntington's disease, cultured cell types that 
conditionally express the glutamine (CAG) repeats 
characteristic of the pathological form of the huntingtin 
5 protein are produced. Two aspects of polyQ-mediated 
cytotoxicity are well-suited to characterization in vitro; 
the expression, distribution, and aggregation of proteins 
with expanded polyQ tracts within individual cells, and the 
progression of polyQ expression to death of the cell. The 

10 huntingtin protein is expressed predominantly in the 
cytoplasm of both neuronal and non-neuronal cell types. 
The expression of polyglutamine tracts separately from the 
huntingtin protein will allow one to determine what toxic 
properties, if any, polyglutamine tracts themselves may 

15 harbor. A variable length polyglutamine fused to cytosolic 
reporter proteins such as green fluorescent protein (GFP) 
facilitates the selection and characterization of PQEC 
ligand-responsive cell populations and clonal lines. The 
use of recombinant retroviruses as PQEC transfer vectors 

2 0 will allow the stable integration of the transgene into a 
wide variety of cultured cell types. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A provides a schematic drawing of the 
nuclear receptor protein isolated from the silk moth 
25 bombyx mori (bR) , as well as a construct containing a 
VP16 activation domain fused at an internal Mlul site 
near the N- terminus of the bombyx mori nuclear receptor, 
referred to as VbR. 



Figure IB depicts the replacement of several amino 
30 acids of the ligand binding domain of the bombyx mori 
nuclear receptor with sequences from the retinoic acid 
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receptor or the thyroid hormone receptor (VbRRA and 
VbRTR, respectively) . 

Figure 1C provides a schematic of a hybrid 
transactivating construct , designated TTMT, containing a 
5 tetracycline-responsive transactivator (Tet -responsive 
region) , a VP16 activation domain and a peptide derived 
from bombyx mori . 



Figure 2A shows the induction of fusion proteins 
(VP16 activation domain fused to either Drosophila 
10 ecdysone receptor (dVEcR) or bombyx mori nuclear receptor 
derived (VbR) ) with 5 /xg/ml tebufenozide in the absence of 
exogenous dimer partners. 

Figure 2B shows that a peptide derived from bombyx 
mori responds to tebufenozide (teb) with greater than 
15 200-fold induction, while Drosophila EcR responds weakly, 
it at all. 



Figure 2C illustrates the action of tebufenozide 
(teb) on the expression of EcRE-Luc reporter when co- 
transfected with the following receptors: 
20 LINX: which encodes the tet- transactivator (TTA) , 

VbR: which encodes VbR, 

TTMT: which encodes the TTA -VbR fusion protein, and 
TTMT-2V: similar to TTMT but with two VP16 t-domains. 

Figure 2D depicts the same experiment illustrated in 
25 Figure 2C, utilizing a TetO-luc reporter, which responds 
only to TTA or fusion proteins in the absence of 
doxycycline . 
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Figure 2E shows the results of an experiment 
employing CV-1 cells transfected with TTMT and two 
reporters: 6TetO-luc and 4-EcRE-P-galactosidase . 

Figure 3 shows that lower levels of teb are as 
5 effective as higher levels for stimulating full activity. 

Figure 4 is a schematic drawing of representative 
single-plasmid constructs involving VbR use. 

10 Figure 5 illustrates the construction of the pBW 

plasmid. 

Figure 6A provides a comparison of Bombyx mori 
ecdysone receptor (BE) and Drosophila melanogaster ecdysone 
receptor (DE) by transient transfection with both receptors 
15 and the dimer partners Usp or RXR in 293 cells. 

Figure 6B provides a similar comparison as in Figure 
6A, however, in CV-1 cells. 

Figure 7A is a schematic drawing of native and 
chimeric ecdysone receptors. 

20 Figure 7B is a graph depicting the transient 

transfection of VEH into CV-1 cells. 

Figure 8 is a schematic drawing comparing several 
different chimeric receptors. 

Figure 9A provides a schematic of receptor variant 
25 viral vectors and an ecdysone response element responsive 
reporter virus (MS) . 
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Figure 9B illustrates induction levels of luciferase 
in 293, CV-1 and primary fibroblast cells infected with MS- 
luc virus. 

Figure 10 illustrates the construction of the Boris I 
5 plasmid. 

Figure 11 illustrates the construction of the Boris 
III plasmid. 

Figure 12 illustrates the construction of the Boris IV 
plasmid . 

10 Figure 13 illustrates the construction of the pBW-6E- 

GFP-CB-VBE plasmid. 

Figure 14 illustrates the retroviral construct, BORIS 
III transcribes a transgene in the presence and absence of 
a ligand. 

15 Figures 15A and B illustrate the construction of a 

PolyQ expression construct. 

Figure 16 provides a comparison of various polyQ-GFP 
vectors constructed. 



DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention, there are 

provided nucleic acid constructs encoding a receptor 
peptide comprising : 

(i) a ligand binding domain and hinge region of 

a non-mammalian member of the nuclear 
2 5 receptor superfamily, 
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(ii) a DNA binding domain, and 

(iii) an activation domain, 

wherein the receptor peptide activates a regulatory element 
in the absence of an exogenous dimer partner therefor and 
5 in the presence of a ligand for said ligand binding domain. 
Alternatively, there are provided nucleic acid constructs 
encoding a receptor peptide comprising: 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 

10 super family, and 

(ii) a DNA binding domain, wherein the DNA 
binding domain is not obtained from the same 
member as the ligand binding domain and 
hinge region, 

15 wherein the receptor peptide activates a regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 



In accordance with another embodiment of the present 
20 invention, there are provided method (s) employing such 
constructs. For example, there are provided method (s) for 
modulating the transcription of exogenous nucleic acid(s) 
in a host containing: 

(i) a nucleic acid construct comprising a promoter 
25 and said exogenous nucleic acid(s) under the 

control of a regulatory element; and 

(ii) a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 

3 0 receptor superfamily which is not normally 

present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
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partner therefor and in the presence of a ligand 
for said ligand binding domain, 



said method comprising administering to said host an 
amount of ligand effective to modulate the transcription of 
5 said exogenous nucleic acid(s); wherein said ligand is not 
normally present in the cells of said host. 

As employed herein, the terms "modulate" and 
"modulating" refer to the ability of a given 
ligand/receptor peptide complex to activate/deactivate 

10 and/or up-regulate/down-regulate transcription of exogenous 
nucleic acids, relative to the transactivation activity of 
said receptor peptide in the absence of ligand. The actual 
effect of complex formation on the transactivation activity 
of a receptor peptide will vary depending on the specific 

15 ligand and DNA binding domains employed in the receptor 
peptide and on the regulatory element with which the 
ligand/receptor peptide complex interacts. 

As employed herein, the term "host" refers to the 
cell, tissue, organ or organism in need of transcriptional 

20 regulation of exogenous or endogenous nucleic acids. 
Preferably, hosts are mammalian or mammalian derived cells 
or tissue. Exemplary mammals include: humans; domesticated 
animals, e.g., rat, mouse, rabbit, canine, and feline; farm 
animals, e.g., chicken, bovine, porcine and ovine; and 

25 animals of zoological interest, e.g., monkey and baboon; 
and the like. 



As employed herein, a "dimer partner" refers to 
members of the nuclear receptor superfamily to which other 
members preferentially bind to form heterodimeric species. 
3 0 For example, members of the nuclear receptor superfamily 
preferentially form heterodimers with a common partner, the 
retinoid X (or 9-cis retinoic acid) receptor (RXR, see, for 
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example, Yu et al . (1991) Cell 67:1251-1266; Bugge et al . 
(1992) EMBO J. 11:1409-1418; Kliewer et al . (1992) Nature 
355:446-449; Leid et al , (1992) Cell 68:377-395; Marks et 
al. (1992) EMBO J. 11:1419-1435; Zhang et al . (1992) Nature 
5 355:441-446; Issemann et al . (1993) Biochimie. 75:251-256). 
Additional dimer partners for members of the nuclear 
receptor superf amily include ultraspiracle (Usp) , f arnesoid 
X receptor (FXR) , and the like. 

Receptor peptides utilized in the present invention 
10 are characterized as being fully functional in mammalian 
cells without the addition of any exogenous dimer partners 
therefor. For such receptor peptides, the presence of 
native level or concentration of endogenous dimer partner 
is sufficient to promote transcription. As employed herein, 
15 the phrase "exogenous dimer partner" refers to a dimer 
partner for the receptor peptide that is introduced into 
the host . Exogenous dimer partner may be employed because 
the dimer partner is not present in the host cell (i.e., 
Usp) or because the dimer partner is required at elevated 
20 levels in the host (i.e., higher levels of RXR) . 
Oppositely, the phrase "endogenous dimer partner (s)" refers 
to dimer partners which are native to the unmodified 
mammalian host cell. Invention receptors are advantageous 
because dimer partner is not necessary at elevated levels 

2 5 in order for formation of heterodimers to occur, thereby 

promoting transcription. 

Members of the nuclear receptor superfamily are 
characterized by the presence of five domains: A/B, C, D, 
E and F (Evans, R. Science 240:889-895 (1988)), wherein "C" 

3 0 corresponds to the DNA binding domain, n D" corresponds to 

the hinge region, and "E" corresponds to the ligand binding 
domain. In accordance with a particular aspect of the 
invention, it has been discovered that the hinge region 
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(D) , in coordination with the ligand binding domain (E) , of 
certain non- mammalian members of the nuclear receptor 
superfamily enable such members to interact with dimer 
partners endogenous to mammalian cells, i.e., in the 
5 absence of and without requiring introduction of exogenous 
dimer partners. Specifically, it has been identified that 
regions within the hinge region ( n D") and the ligand 
binding domain ("E") of the bombyx mori receptor function 
in concert to produce high affinity, ligand -dependent 
10 heterodimerization between bR and mammalian dimer partners, 
without requiring the introduction of exogenous dimer 
partners such as Usp. 

Thus, in accordance with a preferred embodiment of the 
present invention, the invention receptor peptide comprises 

15 the hinge region and ligand binding domain derived from 
insect receptors of lepidopteran species such as bombyx 
mori, (Swevers et al . Insect Biochem. Molec. Biol. 
25 (7) :857-866 (1995)), Choristoneura fumiferana (Palli et 
al. Insect Biochem. Molec. Biol. 26 (5) :485-499 (1996)), 

20 Manduca sexta (Fujiwara et al . Insect Biochem. Molec. Biol. 
25 (7) : 845-856 (1995)), Aedes aegypti (Cho et al. Insect 
Biochem Molec. Biol. 25:19-27 (1995), Chorinomus tentans 
(Imhof et al. Insect Biochem. Molec. Biol. 25:115-124 
(1993) , and the like. Presently preferred insect receptors 

2 5 from which the hinge region and/or ligand binding domain is 
derived substantially lack the Oterminal »F n domain. In 
addition, presently preferred insect receptors share less 
than 50% sequence similarity with Drosophila ecdysone 
receptors, comparisons performed by Higgins and Sharp 

30 sequence alignment. 

As employed herein, the term "hinge region" or "D 
domain" refers to the domain located between the DNA 
binding domain and the ligand binding domain of intact 
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members of the nuclear receptor superfamily. In accordance 
with a preferred embodiment of the present invention, the 
hinge region facilitates high affinity interaction with 
dimer partners endogenous to mammalian cells, also referred 
5 herein as "endogenous dimer partners." Preferably, the 
hinge region of a non-mammalian member of the nuclear 
receptor superfamily can be characterized as any sequence 
having substantial sequence identity with amino acid 
residues 273-3 62 set forth in SEQ ID NO: 3, or substantial 
10 portions thereof which confer sufficiently high affinity 
for endogenous dimer partner. 

The hinge region can be functionally located in either 
orientation and at various positions within the receptor 
peptide. For example, the hinge region can be positioned 
15 at either the amino or carboxy terminus of the receptor 
peptide, or therebetween. In a preferred embodiment of the 
present invention, the hinge region is positioned 
internally between the ligand binding and DNA binding 
domains of the receptor peptide (see Figure 1A) . 

20 As employed herein, the phrase "ligand binding domain 

of a non-mammalian member of the nuclear receptor 
superfamily" refers to ligand binding domains derived from 
receptors which are not endogenous to a mammalian host . 
Ligand binding domains which are not endogenous to a 

25 mammalian host include ligand binding domains which are 
modified to be non- responsive to ligands endogenous or 
native to the host. Ligand binding domains contemplated 
for use according to the present invention can be derived 
from non-mammalian member (s) of the nuclear receptor 

3 0 superfamily which members are not normally present in the 
cells of a host. Ligand binding domains are preferably 
derived from the carboxy- terminal portion of non-mammalian 
receptors which are capable of activating transcription of 
regulatory elements in the absence of exogenous dimer 
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partner therefore. Exemplary receptors which are not 
normally present in mammalian cells include insect 
receptors, plant receptors, and the like. 

Ligand binding domains can be functionally located in 
5 either orientation and at various positions within the 
receptor peptide. For example, the ligand binding domain 
can be positioned at either the amino or carboxy terminus 
of the receptor peptide, or therebetween. In a preferred 
embodiment of the present invention, the ligand binding 
10 domain is positioned at the carboxy terminus of the 
receptor peptide (see Figure 1A) . 

Exemplary ligand binding domains can alternatively be 
characterized as comprising substantial sequence identity 
with amino acid residues 393-508 ("E 2 "), preferably amino 

15 acid residues 393-586 (E 2 and E 3 , wherein "E 3 " is amino acid 
residues 509-586), as set forth in SEQ ID NO : 3 , or 
substantial portions thereof (i.e., typically at least 46 
or more contiguous nucleotides thereof) . It has been 
determined that the specific ligand response determinant of 

20 bombyx lies within the E 2 region of the native bR, whereas 
both E 2 and E 3 regions facilitate heterodimer formation with 
endogenous dimer partners. Modifications of this sequence 
contemplated for use in the practice of the present 
invention include replacing several amino acids of the 

25 ligand binding domain with sequences from ligand binding 
domains of other members of the nuclear receptor 
superfamily, such as retinoic acid receptor and/or thyroid 
hormone receptor (Figure IB) . These modifications provide 
unique transactivating characteristics and/or eliminate 

30 restriction sites, which facilitate the construction of 
useful peptide-based viral constructs. 

DNA-binding domains contemplated for use in the 
preparation of invention receptor peptides are typically 
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obtained from DNA-binding proteins (e.g., transcription 
factors) . The term "DNA-binding domain" is understood in 
the art to refer to an amino acid sequence that is able to 
bind to DNA. As used herein, the term "DNA-binding domain" 
5 encompasses a minimal peptide sequence of a DNA-binding 
protein up to the entire length of a DNA-binding protein, 
so long as the DNA-binding domain functions to associate 
with a particular regulatory element. 

DNA-binding domains are known to function 
10 heterologously in combination with other functional domains 
by maintaining the ability to bind DNA recognition sequence 
(see, e.g., Brent and Ptashne, (1985) Cell, 43:729-736). 
For example, with respect to hormone receptors, DNA-binding 
domains are interchangeable, thereby providing numerous 
15 chimeric receptor proteins (see, e.g., U.S. Patent 
4,981,784; and Evans, R. , (1988) Science, 240:889-895). 
Similar to the ligand binding domain, the DNA-binding 
domain can be positioned at either the carboxy terminus or 
the amino terminus, or the DNA-binding domain can be 

2 0 positioned between the ligand binding domain and the 

activation domain. In preferred embodiments of the present 
invention, the DNA-binding domain is positioned internally 
between the ligand binding domain and the activation 
domain. 

25 "DNA-binding protein (s) " contemplated for use herein 

belong to the well-known class of proteins that are able to 
directly bind DNA and facilitate initiation or repression 
of transcription. Exemplary DNA-binding proteins 

contemplated for use herein include transcription control 

3 0 proteins (e.g., transcription factors and the like; Conaway 

and Conaway, 1994, "Transcription Mechanisms and 
Regulation", Raven Press Series on Molecular and Cellular 
Biology, Vol. 3, Raven Press, Ltd., New York, NY). 
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Transcription factors contemplated for use herein as 
a source of such DNA binding domains include, e.g., 
homeobox proteins, zinc finger proteins, hormone receptors, 
helix-turn-helix proteins , helix-loop-helix proteins, 
5 basic-Zip proteins (bZip) , p-ribbon factors, and the like. 
See, for example, Harrison, S., "A Structural Taxonomy of 
DNA-binding Domains," Nature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al . , 1993, Mol. Endo., 

10 7:1275-1283), Antp, Mat a-2, INV, and the like. See, also, 
Scott et al. (1989), Biochem. Biophys. Acta, 989:25-48. It 
has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard 
et al., (1993) Mol . Endo., 7:1275-1283) containing the STF- 

15 1 homeodomain binds DNA as tightly as wild- type STF-1. 
Suitable zinc finger DNA-binding proteins for use herein 
include Zif268, GLI, XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. Sci . , 12:464; Jacobs and 
Michaels (1990) New Biol . , 2:583; and Jacobs (1992), EMBO 

20 J. , 11:4507-4517. 

An additional DNA binding domain contemplated for use 
in the practice of the present invention is the GAL4 DNA 
binding domain. The DNA binding domain of the yeast GAL4 
protein comprises at least the first 74 amino terminal 
2 5 amino acids thereof (see, for example, Keegan et al . , 
Science 231:699-704 (1986)). Preferably, the first 90 or 
more amino terminal amino acids of the GAL 4 protein will be 
used, with the 147 amino terminal amino acid residues of 
yeast GAL4 being presently most preferred. 

30 Preferably, DNA-binding domain (s) used herein is (are) 

obtained from a member of the nuclear receptor superfamily. 
As used herein, the phrase "member (s) of the nuclear 
receptor superfamily" (also known as "intracellular 
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receptors" or "steroid/thyroid hormone superfamily of 
receptors") refers to hormone binding proteins that operate 
as ligand-dependent transcription factors, including 
identified members of the nuclear receptor superfamily for 
5 which specific ligands have not yet been identified 
(referred to hereinafter as "orphan receptors"). 

Exemplary members of the steroid/thyroid hormone 
superfamily of receptors (including the various isoforms 
thereof) include steroid receptors such as glucocorticoid 

10 receptor (GR) , mineralocorticoid receptor (MR) , estrogen 
receptor (ER) , progesterone receptor (PR) , androgen 
receptor (AR) , vitamin D 3 receptor (VDR) , various isoforms 
of peroxisome prolif erator-activated receptors (PPARs) , and 
the like; plus retinoid receptors, such as the various 

15 isoforms of retinoic acid receptor (e.g., RARa, RARp, or 
RARy) / the various isoforms of retinoid X receptor (e.g., 
RXRa, RXR3, or RXRy) # and the like (see, e.g., U.S. Patent 
Nos. 4,981,784; 5,171,671; and 5,071,773); thyroid 
receptors (TR) , such as TRa, TR(3, and the like; insect 

20 derived receptors such as the ecdysone receptor, and the 
like; as well as other gene products which, by their 
structure and properties, are considered to be members of 
the superfamily, as defined hereinabove, including the 
various isoforms thereof. Examples of orphan receptors 

25 contemplated for use herein as a source of DNA binding 
domain include HNF4 (see, for example, Sladek et al . , in 
Genes & Development 4: 2353-2365 (1990)), the COUP family 
of receptors (see, for example, Miyajima et al . , in Nucleic 
Acids Research 16: 11057-11074 (1988), and Wang et al . , in 

30 Nature 340: 163-166 (1989)), COUP-like receptors and COUP 
homologs, such as those described by Mlodzik et al . , in 
Cell 60: 211-224 (1990) and Ladias et al . , in Science 251: 
561-565 (1991) , the insect derived knirps and knirps- 
related receptors, and the like. 
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The DNA-binding domains of all members of the nuclear 
receptor superfamily are related. Such domains consist of 
66-68 amino acid residues, and possess about 2 0 invariant 
amino acid residues, including nine cysteines. Members of 
5 the superfamily are characterized as proteins which contain 
these 20 invariant amino acid residues. The highly 
conserved amino acids of the DNA-binding domain of members 
of the superfamily are as follows: 



Cys - X - X - Cys - X - X - Asp* - X - Ala* - X - 
10 Gly* - X - Tyr* - X - X - X - X - Cys - X - X - 

Cys - Lys* - X - Phe - Phe - X - Arg* - X - X - 
X - X - X - X - X - X - X - (X - X -) Cys - X -X 

- X - X - X - (X - X - X -) Cys - X -X - X - Lys 

- X - X - Arg - X - X - Cys - X - X - Cys - Arg* 
15 - X - X - Lys* - Cys - X - X - X - Gly* - Met 

(SEQ ID NO:l) ; 



wherein X designates non-conserved amino acids within the 
DNA-binding domain; an asterisk denotes the amino acid 
residues which are almost universally conserved, but for 
20 which variations have been found in some identified hormone 
receptors; and the residues enclosed in parenthesis are 
optional residues (thus, the DNA-binding domain is a 
minimum of 66 amino acids in length, but can contain 
several additional residues) . 



25 It has also been found that in vitro evolution methods 

can be applied to modify and improve existing DNA-binding 
domains (see, e.g., Devlin et al . , 1990, Science , 249:404- 
406; and Scott and Smith, 1990, Science , 249:386-390) . 
Alternatively, DNA-binding domains which are engineered 

30 with novel DNA- recognition specificity (see, e.g., 
Pomerantz et al . Science 267:93-96, 1995, ZFHD1 , an 
engineered transcription factor with a composite DNA- 
binding domain) are also contemplated. 
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Two polynucleotides or polypeptides are said to be 
"identical" if the sequence of nucleotides or amino acid 
residues in the two sequences is the same when aligned for 
maximum correspondence. Optimal alignment of sequences for 
5 comparison may be conducted by the local homology algorithm 
of Smith and Waterman, Adv. Appl . Math., 2:482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, 
J. Mol. Biol., 48:443 (1970), by the search for similarity 
method of Pearson and Lipman, Proc. Natl. Acad. Sci. 
10 (U.S.A.) , 85:2444 (1988), by computerized implementations 
of these algorithms (GAP, BESTFIT, FASTA, and T FAST A in the 
Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, Wis.), or by inspection. 
These references are incorporated herein by reference. 

15 The percentage of sequence identity between two 

sequences is determined by comparing two optimally aligned 
sequences over a window of comparison of at least 2 0 
positions. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid 

2 0 base or amino acid residue occurs in both sequences to 
yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in 
the window of comparison (i.e., the window size), and 
multiplying the result by 100 to yield the percentage of 

2 5 sequence ident ity . 

For instance, a preferred method for comparing 
sequences uses the GAP program based on the algorithm of 
Needleman, et at., supra. Typically, the default values for 
all parameters are selected. These are gap weight: 5.0, 

3 0 length weight: 0.30, average match: 1.0, and average 

mismatch: 0.0. 
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As used herein, the phrase "substantial sequence 
identity" refers to nucleotide or amino acid sequences 
which share at least 80% sequence identity, preferably 90%, 
more preferably 95 % or more, regardless of the algorithm 
5 used to determine sequence identity, compared to a 
reference sequence over a comparison window of about 2 0 bp 
to about 2000 bp, typically about 50 to about 1500 bp, 
usually about 350 bp to about 1200. The values of percent 
identity are preferably determined using the GAP program, 

10 referred to above. Another indication that nucleotide 
sequences are substantially identical is if two molecules 
hybridize to each other under stringent conditions. It is 
recognized, however, that proteins (and DNA or mRNA 
encoding such proteins) containing less than the above- 

15 described level of homology produced as splice variants or 
as a result of conservative amino acid substitutions (or 
substitution of degenerate codons) are contemplated to be 
within the scope of the present invention. 

The phrase "substantially the same," used in reference 

2 0 to a DNA nucleotide sequence, an RNA ribonucleotide 

sequence, or an amino acid sequence, refers to sequences 
that have slight and non- consequential sequence variations 
from the actual sequences disclosed herein. Species that 
are substantially the same are considered to be equivalent 
25 to the disclosed sequences. In this regard, "slight and 
non-consequential sequence variations" mean that sequences 
that are substantially the same as the DNA, RNA, or 
proteins disclosed and claimed herein are functionally 
equivalent to the sequences disclosed and claimed herein. 

3 0 Functionally equivalent sequences will function in 

substantially the same manner to produce substantially the 
same effect as the sequences disclosed. 

Receptor peptides employed in the present invention 
can be modified by the introduction of activation domains. 
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Activation domains contemplated for use in the practice of 
the present invention are well known in the art and can 
readily be identified by those of skill in the art. 
Activation domains contemplated for use herein are 
5 typically derived from transcription factors and comprise 
a contiguous sequence that functions to activate gene 
expression when associated with a suitable DNA-binding 
domain and a suitable ligand binding domain. An activation 
domain can be positioned at any convenient site within the 
10 receptor peptide, i.e., at the carboxy terminus, the amino 
terminus or between the ligand binding domain and the DNA 
binding domain. In preferred embodiments of the present 
invention, the activation domain is positioned at the amino 
terminus of the receptor peptide. 

15 Suitable activation domains can be obtained from a 

variety of sources, e.g., from the N-terminal region of a 
member of the steroid/thyroid hormone superfamily of 
receptors, from a transcription factor activation domain, 
such as, for example, VP16, GAL4 , NF-kB or BP64 activation 

20 domains, and the like. The presently most preferred 
activation domain contemplated for use in the practice of 
the present invention is obtained from the N-terminal 
region of the VP16 protein. 

In a particular embodiment of the present invention, 
25 the invention nucleic acid construct further comprises 
promoters and regulatory elements operatively associated 
with exogenous nucleic acids. In a preferred embodiment of 
the present invention, receptor peptide, in the presence of 
a ligand therefor, binds the regulatory element and 
3 0 activates transcription of the exogenous nucleic acids. 

Regulatory elements contemplated for use in the 
practice of the present invention include elements 
responsive to the invention receptor peptide. In a 
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preferred embodiment of the present invention, such 
elements are exogenous regulatory elements not normally 
present in the cells of the host. One class of exogenous 
regulatory elements contemplated for use herein includes 
5 hormone response elements which modulate transcription of 
exogenous nucleic acid when bound to the DNA binding domain 
of an invention receptor peptide. 

Additional regulatory elements that may be utilized in 
the practice of the present invention include exogenous 

10 regulatory elements responsive to a non-mammalian 
transactivator . One such transactivator-responsive 

regulatory element is an operator which confers 
responsiveness to antibiotics. Exemplary operators 

contemplated for use in this aspect of the invention 

15 include the tetracycline-analog regulated operator, the TET 
operator, the Lac operator, and the like. 

Exogenous response elements contemplated for use 
herein are short cis-acting sequences (i.e., having about 
12-20 bp) that are required for activation of transcription 

20 in response to association with a suitable ligand, such as 
diacyl hydrazines, and an invention receptor peptide. 
Response element sequences contemplated for use herein 
function in a position- and orientation- independent 
fashion. Exemplary response elements include hormone 

25 response elements, GAL4 response elements and the like. 

Hormone response elements contemplated for use in the 
practice of the present invention are response elements 
which are responsive to members of the nuclear receptor 
superfamily. These response elements comprise at least two 
30 half-sites (in either direct repeat or inverted repeat 
orientation to one another) , separated by a spacer of 0-5 
nucleotides. As used herein, the term "half -site" refers 
to a contiguous 6 nucleotide sequence that is bound by a 
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particular member of the nuclear receptor superf amily . 
Typically, two half -sites with a corresponding spacer make 
up a hormone response element . Hormone response elements 
can be incorporated in multiple copies into various 
5 transcription regulatory regions. 

Preferred hormone response elements employed in the 
practice of the present invention comprise a first half- 
site and a second half -site, separated by a spacer of 0-5 
nucleotides; 

10 wherein each half -site has the sequence: 

-RGBNNM- , 
(or complements thereof) wherein 

each R is independently selected from A or G; 
each B is independently selected from G, C, or T; 
15 each N is independently selected from A, T, C, or G; 

and 

each M is independently selected from A or C; 
with the proviso that at least 4 nucleotides of each 
-RGBNNM- group of nucleotides are identical with the 
2 0 nucleotides at comparable positions of the sequence 
-AGGTCA- . 

Exemplary half- sites having the -RGBNNM- motif for use 
in preparing response elements useful in the practice of 
the present invention include, for example, half -sites 
25 selected from -AGGGCA- , -AGTTCA- , -AGGTAA- , -AGGTCA- , 
-GGTTCA-, -GGGTTA-, -GGGTGA- , -AGGTGA- , -GGGTCA- , and the 
like. A particularly preferred first half -site is 
-AGTGCA- . 

Additional response elements included the GAL4 
30 response element. Exemplary GAL 4 response elements are 
those containing the palindromic 17-mer: 
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5 ' - CGGAGGACTGTCCTCCG- 3 ' ( SEQ ID NO : 4 ) , 

such as, for example, 17MX, as described by Webster et al . , 
in Cell 52:169-178 (1988), as well as derivatives thereof. 
Additional examples of suitable response elements include 
5 those described by Hollenberg and Evans in Cell 55:899-906 
(1988); or Webster et al . in Cell 54:199-207 (1988). 

Regulatory elements employed in the practice of the 
present invention are operably linked to a suitable 
promoter for transcription of exogenous nucleic acid(s) 
10 product (s). When exogenous nucleic acid(s), operatively 
linked to a suitable promoter, is (are) introduced into the 
cells of a suitable host, expression of the exogenous 
nucleic acid(s) is (are) controlled by the presence of 
ligands, which are not normally present in the host cells. 

15 As used herein, when referring to nucleic acids, the 

phrase "exogenous to said mammalian host" or simply 
"exogenous" refers to nucleic acids not naturally found at 
levels sufficient to provide a function in the particular 
cell where transcription is desired. For example, 

2 0 exogenous nucleic acids can be either natural or synthetic 
nucleic acids, which are introduced into the host in the 
form of DNA or RNA. The nucleic acids of interest can be 
introduced into target cells (for in vitro applications) , 
or the nucleic acids of interest can be introduced directly 

2 5 or indirectly into a host by the transfer of transformed 

cells into a host. 

In contrast to exogenous nucleic acids, the phrase 
"endogenous nucleic acids" or "endogenous genes" refers to 
nucleic acids naturally found at levels sufficient to 

3 0 provide a function in the particular cell where 

transcription is desired. 
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Exogenous nucleic acids contemplated for use in the 
practice of the present invention include wild type and/or 
therapeutic nucleic acids. 

"Wild type" genes are those that are native to cells 
5 of a particular type. Exemplary wild type nucleic acids 
are genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in a host; or 

a substantial excess of which leads to the 
10 occurrence of a non-normal state in a host. 

Such genes may not be expressed in biologically significant 
levels or may be undesirably overexpressed, respectively. 
Thus, for example, while a synthetic or natural gene coding 
for human insulin would be exogenous genetic material to a 
15 yeast cell (since yeast cells do not naturally contain 
insulin genes) , a human insulin gene inserted into a human 
skin fibroblast cell would be a wild type gene with respect 
to the fibroblast since human skin fibroblasts contain 
genetic material encoding human insulin, although human 

2 0 skin fibroblasts do not express human insulin in 

biologically significant levels. 

Therapeutic nucleic acids contemplated for use in the 
practice of the present invention include those which: 

encode products which are toxic to the cells in 
25 which they are expressed; or 

encode products which impart a beneficial 
property to a host; or 
those which transcribe nucleic acids which modulate 
transcription and/or translation of endogenous genes, 

3 0 As employed herein, the phrase "therapeutic nucleic 

acids" refers to nucleic acids which impart a beneficial 
function to the host in which such nucleic acids are 
transcribed. Therapeutic nucleic acids are those that are 
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not naturally found in host cells. For example, synthetic 
or natural nucleic acids coding for wild type human insulin 
would be therapeutic when inserted into a skin fibroblast 
cell so as to be expressed in a human host, where the human 
5 host is not otherwise capable of expressing functionally 
active human insulin in biologically significant levels. 
Further examples of therapeutic nucleic acids include 
nucleic acids which transcribe antisense constructs used to 
suppress the expression of endogenous genes. Such 

10 antisense transcripts bind endogenous nucleic acid (mRNA or 
DNA) and effectively cancel out the expression of the gene. 
In accordance with the methods described herein, 
therapeutic nucleic acids are expressed at a level that 
provides a therapeutically effective amount of the 

15 corresponding therapeutic protein. 

Exogenous nucleic acids useful in the practice of the 
present invention include genes that encode biologically 
active proteins of interest, such as, e.g., secretory 
proteins that can be released from said cell; enzymes that 
20 can metabolize a toxic substance to produce a non-toxic 
substance, or that metabolize an inactive substance to 
produce a useful substance; regulatory proteins; cell 
surface receptors; and the like. Useful genes include 
genes that encode blood clotting factors such as human 

2 5 factors VIII and IX; genes that encode hormones such as 

insulin, parathyroid hormone, luteinizing hormone releasing 
factor (LHRH) , alpha and beta seminal inhibins, and human 
growth hormone; genes that encode proteins such as enzymes, 
the absence of which leads to the occurrence of an abnormal 

3 0 state; genes encoding cytokines or lymphokines such as 

interferons, granulocytic macrophage colony stimulating 
factor (GM-CSF) , colony stimulating factor-1 (CSF-1) , tumor 
necrosis factor (TNF) , and erythropoietin (EPO) ; genes 
encoding inhibitor substances such as alpha^antitrypsin; 
35 genes or nucleotides which characterize specific conditions 
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or diseases (i.e., CAG repeat expansion and concomitant 
polyglutamine (polyQ) expression has been linked to a 
variety of neurodegenerative conditions including 
Huntington's disease, dentatorubro-pallidoluysian atrophy, 
5 spinocerebellar ataxias, and spiriobulbar muscular atrophy, 
and the like) ; genes encoding substances that function as 
drugs, e.g., genes encoding the diphtheria and cholera 
toxins; and the like. 

Additional nucleic acids contemplated for use in 
10 accordance with the present invention include genes which 
encode proteins present in dopaminergic neurons (useful, 
for example, for the treatment of Parkinson ! s disease), 
cholinergic neurons (useful, for example, for the treatment 
of Alzheimer's disease), hippocampal pyramidal neurons 
15 (also useful for the treatment of Alzheimer's disease), 
norepinephrine neurons (useful, for example, for the 
treatment of epilepsy) , spinal neurons (useful, for 
example, for the treatment of spinal injury) , glutamatergic 
neurons (useful, for example, for the treatment of 
20 schizophrenia), cortical neurons (useful, for example, for 
the treatment of stroke and brain injury) , motor and 
sensory neurons (useful, for example, for the treatment of 
amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for 
25 proteins encoded by exogenous nucleic acid(s) contemplated 
for use employed herein can be located in one of many 
public access databases, e.g., GENBANK, EMBL, Swiss-Prot, 
and PIR, or in related journal publications. Thus, those 
of skill in the art have access to sequence information for 
3 0 virtually all known genes. Those of skill in the art can 
either obtain the corresponding nucleic acid molecule 
directly from a public depository or the institution that 
published the sequence. Optionally, once the nucleic acid 
sequence encoding a desired protein has been ascertained, 
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the skilled artisan can employ routine methods, e.g., 
polymerase chain reaction (PCR) amplification, to isolate 
the desired nucleic acid molecule from the appropriate 
nucleic acid library. Thus, all known nucleic acids 
5 encoding proteins of interest are available for use in the 
methods and products described herein. 

Additional components which can optionally be 
incorporated into invention constructs include selectable 
markers. Selectable markers contemplated for use in the 

10 practice of the present invention include radiolabeled 
molecules, fluorescent molecules, ligands, enzymes, and the 
like. Preferable selectable markers are enzymes such as 
antibiotic resistance genes, genes which enable cells to 
process metabolic intermediaries, and the like. Exemplary 

15 antibiotic resistance genes include genes which impart 
tetracycline resistance, genes which impart ampicillin 
resistance, zeomycin resistance, neomycin resistance, 
hygromycin resistance, puromycin resistance, and the like. 
Genes which enable cells to process metabolic 

20 intermediaries include genes which permit cells to 
incorporate L-histidinol , genes encoding thymidine kinase, 
genes encoding xanthine -guanine phosphoribosyl transferase 
(gpt) , genes encoding dihydrof olate reductase, genes 
encoding asparagine synthetase, and the like. 

2 5 In accordance with a preferred embodiment of the 

present invention, the invention nucleic acid construct 
further comprises, in addition to the receptor peptide, a 
non-mammalian transact ivator, not a member of the nuclear 
receptor superfamily and not normally present in the cells 

30 of said host, and a compatible transactivator responsive 
regulatory element not normally present in cells of said 
host. The transactivator responsive regulatory element 
controls transcription of the exogenous nucleic acid(s) or 
a second nucleic acid construct comprising a second 
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exogenous nucleic acid(s) . Examples of transactivator 
responsive regulatory elements include operators which are 
responsive to non- mammalian transactivators which confer 
responsiveness to antibiotics. Exemplary operators 

5 contemplated for use in this aspect of the invention 
include the tetracycline-analog regulated operator, the TET 
operator, the Lac operator, and the like. Preferably, the 
transactivator responsive regulatory elements employed in 
the practice of the present invention are operably linked 
10 to a suitable promoter for transcription of exogenous 
nucleic acid(s) proteins. 

Non-mammalian transactivators, other than members of 
the nuclear receptor superfamily, contemplated for use in 
the practice of the present invention function in the 

15 absence of exogenous dimer partner. Examples of 

transactivators that typically function in the absence of 
exogenous dimer partner are tetracycline-controlled 
transactivators, Vpl6-Lac fusion transactivators, and the 
like. When contained as part of a transactivating 

20 construct, the transactivator can be positioned at any 
convenient site within the construct, i.e., at the carboxy 
terminus or the amino terminus of the transactivating 
construct. In preferred embodiments of present invention, 
the transactivator is positioned at the amino terminus of 

25 the transactivating construct (Figure 1C) . 

Preferably, the non-mammalian transactivator confers 
responsiveness to antibiotics. An example of a non- 
mammalian transactivator which confers responsiveness to 
antibiotics, as contemplated for use in the practice of the 
30 present invention is the tetracycline-controlled 
transactivator. The tetracycline inducible system is well- 
known in the art (see, e.g, Gossen et al . (1992) PNAS 89, 
5547-5551; Gossen et al . (1993) TIBS 18, 471-475; Furth et 
al. (1994) PNAS 91, 9302-9306; Shockett et al . (1995) PNAS 
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92, 6522-6526; and Hoshimaru et al. (1996) PNAS 93 (4) : 1518- 
1523) . Other examples of non- mammalian transactivators 
well known in the art include the IPTG inducible system 
based on a VP16-Lac repressor fusion which functions 
5 through lac operator sequences inserted into heterologous 
promoters (see, e.g., Bairn et al . (1991) PNAS 88:5072- 
5076) . 

In addition to the receptors/transactivators set forth 
above, those of skill in the art recognize that other 

10 transactivators can be used herein, e.g., homeobox 
proteins, zinc finger proteins, hormone receptors, helix- 
turn-helix proteins, helix-loop-helix proteins, basic-Zip 
proteins (bZip) , (3-ribbon factors, and the like. See, for 
example, Harrison, S., H A Structural Taxonomy of DNA- 

15 binding Domains," Mature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al . , (1993) Mol . Endo., 
7:1275-1283), Antp, Mat oe-2, INV, and the like. See, also, 
Scott et al. (1989) Biochem. Biophys. Acta, 989:25-48. It 

2 0 has been found that a fragment of 76 amino acids 

(corresponding to amino acids 140-215 described in Leonard 
et al., (1993) Mol. Endo., 7:1275-1283) containing the STF- 
1 homeodomain binds DNA as tightly as wild-type STF-1. 
Suitable zinc finger DNA-binding proteins for use herein 
25 include Zif268, GLI , XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. Sci., 12:464; Jacobs and 
Michaels (1990) New Biol., 2:583; and Jacobs (1992) EMBO 
J. , 11:4507-4517. 

In a preferred embodiment of the present invention, 

3 0 expression cassettes are prepared by operably linking 

invention nucleic acid constructs to a suitable promoter 
for expression of the encoded receptor peptide. As used 
herein, the term "promoter" refers to a specific nucleotide 
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initiates RNA synthesis. The promoter sequence is the site 
at which transcription can be specifically initiated under 
proper conditions. When nucleic acid constructs, 

5 operatively linked to a suitable promoter, is introduced 
into the cells of a suitable host, expression of the 
nucleic acid construct is conditionally or perpetually 
initiated . 

Promoters contemplated for use in the practice of the 
10 present invention include inducible (e.g., minimal CMV 
promoter, minimal TK promoter, modified MMLV LTR) , 
constitutive (e.g., chicken b-actin promoter, MMLV LTR 
(non-modified), DHFR) , and/or tissue specific promoters. 

Inducible promoters contemplated for use in the 
15 practice of the present invention comprise transcription 
regulatory regions that function maximally to promote 
transcription of mRNA under inducing conditions. Examples 
of suitable inducible promoters include DNA sequences 
corresponding to: the E. coli lac operator responsive to 
20 IPTG (see Nakamura et al., Cell, 18:1109-1117, 1979); the 
metal lothione in promoter metal -regulatory- elements 
responsive to heavy-metal (e.g., zinc) induction (see Evans 
et al., U.S. Patent No. 4,870,009), the phage T71ac 
promoter responsive to IPTG (see Studier et al . , Meth. 
25 Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat - 
shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use 
in the practice of the present invention include the CMV 
30 promoter, the SV40 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 
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factor la (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CD 3 promoter, immunoglobulin light or 
heavy chain promoters, neurof iliment promoter, neuron 
5 specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, 
thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter, Low Density Lipoprotein (LDL) 
10 promoter, chicken b-actin promoter (used in conjunction 
with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 
15 which a particular promoter, which drives expression of a 
given gene, is derived (e.g., expressed only in T-cells, 
endothelial cells, smooth muscle cells, and the like) . 
Exemplary tissue specific promoters contemplated for use in 
the practice of the present invention include the GH 

2 0 promoter, the NSE promoter, the GFAP promoter, 

neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
choline acetyltransf erase , ChAT, and the like), promoters 
for neurotropic factors (e.g., a nerve growth factor 
promoter, NT- 3, BDNF promoters, and the like) , and so on. 

25 In yet another embodiment of the present invention, 

there are provided constructs comprising a promoter, a 
tetracycline-controlled transact ivator, a VP16 activation 
domain, a DNA binding domain and bombyx mori -derived hinge 
region and ligand binding domain encoding sequence, wherein 

3 0 the components of the construct are operatively associated 

with the other components of the construct. 

As used herein, the phrase "operatively associated 
with" refers to the functional relationship of DNA with 
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regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational 
stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the 
5 physical and functional relationship between the DNA and 
promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that 
specifically recognizes, binds to and transcribes the DNA. 

In yet another embodiment of the present invention, 
10 there are provided constructs comprising nucleic acid(s) 
encoding a a VP16 activation domain operatively associated 
with nucleic acid encoding a DNA binding domain and hombyx 
mori- derived hinge region and ligand binding domain. 

In accordance with another embodiment of the present 
15 invention, there are provided gene transfer vectors useful 
for the introduction of invention constructs into suitable 
host cells. Such gene transfer vectors preferably comprise 
a first reporter under the control of a regulatory element, 
a second reporter under the control of an operator which is 

2 0 responsive to a ligand-mediated receptor which confers 

responsiveness to antibiotics, and a construct comprising 
a promoter and nucleic acid encoding a VP16 activation 
domain, a DNA binding domain and bombyx mori- derived hinge 
region and ligand binding domain, optionally, a 
25 tetracycline-controlled transactivator . The number of 
copies of regulatory elements can readily be varied by 
those of skill in the art. For example, transcription 
regulatory regions can contain from 1 up to about 50 copies 
of a particular regulatory element, preferably 2 up to 

3 0 about 25 copies, more preferably 3 up to about 10-15 

copies, with about 4-6 copies being especially preferred. 

Gene transfer vectors (also referred to as "expression 
vectors") contemplated for use herein are recombinant 
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nucleic acid molecules that are used to transport invention 
nucleic acid constructs into host cells for expression 
and/or replication thereof. Expression vectors may be 
either circular or linear, and are capable of incorporating 
5 a variety of nucleic acid constructs therein. Expression 
vectors typically come in the form of a plasmid that, upon 
introduction into an appropriate host cell, results in 
expression of the inserted nucleic acid. 

Suitable expression vectors for use herein include a 
10 recombinant DNA or RNA construct (s) , such as plasmids, 
phage, recombinant virus or other vectors that, upon 
introduction into an appropriate host cell, result (s) in 
expression of the inserted DNA. Appropriate expression 
vectors are well known to those of skill in the art and 
15 include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal or 
those which integrate into the host cell genome. 

As used herein, the phrase "transcription regulatory 
region" refers to that portion of a nucleic acid or gene 
20 construct that controls the initiation of mRNA 
transcription. Regulatory regions contemplated for use 
herein, in the absence of the non-mammalian transactivator , 
typically comprise at least a minimal promoter in 
combination with a regulatory element responsive to the 
25 ligand/receptor peptide complex. A minimal promoter, when 
combined with a regulatory element functions to initiate 
mRNA transcription in response to a ligand/receptor peptide 
complex. However, transcription typicallywill not occur 
unless the required inducer (ligand therefor) is present. 
30 Conversely, when the non-mammalian transactivator, other 
than members of the nuclear receptor superfamily, is 
present in the host, the transactivator-responsive 
regulatory element will typically only induce transcription 
in the absence of ligand therefor. 
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Preferably, the transcription regulatory region 
further comprises a binding site for ubiquitous 
transcription factor (s) . Such binding sites are preferably 
positioned between the promoter and the regulatory element. 
5 Suitable ubiquitous transcription factors for use herein 
are well-known in the art and include, for example, Spl . 

Expression vectors suitable for use in the practice of 
the present invention are well known to those of skill in 
the art and include those that are replicable in eukaryotic 
10 cells and/or prokaryotic cells as well as those that remain 
episomal and those that integrate into the host cell 
genome. Expression vectors typically further contain other 
functionally important nucleic acid sequences encoding 
antibiotic resistance proteins, and the like. 

15 Exemplary eukaryotic expression vectors include 

eukaryotic constructs, such as the pSV-2 gpt system 
(Mulligan et al . , (1979) Nature, 277:108-114); pBlueSkript 
(Stratagene, La Jolla, CA) , the expression cloning vector 
described by Genetics Institute {Science, (1985) 228:810- 

20 815), and the like. Each of these plasmid vectors are 
capable of promoting expression of the receptor peptide of 
interest . 

In a specific embodiment, a gene transfer vector 
contemplated for use herein is a viral vector, such as 

25 Adenovirus, adeno-associated virus, or herpes-simplex virus 
based vectors, and synthetic vectors for gene therapy, and 
the like (see, e.g., Suhr et al . (1993) Arch, of Neurol, 
50:1252-1268). Adenovirus and adeno-associated virus are 
extremely suitable as gene transfer vectors to produce 

30 receptor peptides. Alternatively, a gene transfer vector 
employed herein is a retroviral vector. Retroviral vectors 
are gene transfer plasmids that have an expression 
construct containing an exogenous nucleic acid residing 
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between two retroviral LTRs . Retroviral vectors typically 
contain appropriate packaging signals (e.g., a retroviral 
psi (*¥) packaging signal) , elements which enable invention 
construct integration into a host cell (e.g., a 5' and/or 
5 a 3 ' long terminal repeat (LTR) ) , and/or genes encoding 
proteins required for retroviral packaging (e.g., the pol 
gene, the gag gene and the env gene) , that enable the 
retroviral vector, RNA transcribed using the retroviral 
vector as a template, to be packaged into a viral virion in 
10 an appropriate packaging cell line (see, e.g., U.S. Patent 
4, 650, 764) . 

Suitable retroviral vectors for use herein are 
described, for example, in U.S. Patents 5,399,346 and 
5,252,479; and in WIPO publications WO 92/07573, WO 

15 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, each of 
which is hereby incorporated herein by reference, in their 
entirety. These documents provide a description of methods 
for efficiently introducing nucleic acids into human cells 
using such retroviral vectors. Other retroviral vectors 

2 0 include, for example, mouse mammary tumor virus vectors 
(e.g., Shackleford et al. , (1988) PNAS, USA, 85:9655-9659), 
human immunodeficiency virus (e.g., Naldini et al . (1996) 
Science 272:165-320), and the like. 

Various procedures are also well-known in the art for 
25 providing helper cells which produce retroviral vector 
particles which are essentially free of replicating virus. 
See, for example, U.S. Patent 4,650,764; Miller, Human Gene 
Therapy, 1:5-14 (1990); Markowitz, et al . , Journal of 
Virology, 61 (4) : 1120-1124 (1988); Watanabe, et al . , 
30 Molecular and Cellular Biology, 3 (12) : 2241-2249 (1983); 
Danos, et al . , PNAS, 85:6460-6464 (1988); and Bosselman, et 
al., Molecular and Cellular Biology, 7 (5) : 1797-1806 (1987), 
which disclose procedures for producing viral vectors and 
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helper cells which minimize the chances for producing a 
viral vector which includes a replicating virus. 



Recombinant retroviruses suitable for carrying out the 
invention methods are produced employing well-known methods 
5 for producing retroviral virions. See, for example, U.S. 
Patent 4,650,764; Miller, Human Gene Therapy, 1:5-14 
(1990); Markowitz, et al . , Journal of Virology, 61(4):1120- 
1124 (1988); Watanabe, et al . , Molecular and Cellular 
Biology, 3 ( 12 ): 2241-2249 (1983); Danos , et al . , PNAS, 
10 85:6460-6464 (1988); and Bosselman, et al . , Molecular and 
Cellular Biology, 7(5) :1797-1806 (1987) . 

Thus, in one embodiment, a recombinant retroviral 
vector can be utilized to express receptor peptide. 
Preferably, the retroviral vector will further comprise a 

15 regulatory element and exogenous nucleic acid under the 
control of the regulatory element. Optionally, the 
retroviral vector can express an antibiotic resistance gene 
(see Figure 1C) . A "covector" can also be utilized to 
provide a nucleic acid construct comprising the promoter, 

20 the regulatory element and exogenous nucleic acid and a 
second antibiotic resistance gene. The co-vector carrying 
exogenous nucleic acid also has LTRs modified to promote 
high-level expression of exogenous nucleic acid (s) only in 
the presence of the receptor peptide encoded by the 

25 recombinant retrovirus and exogenous ligand therefor. Co- 
infected primary mammalian cells can then be selected using 
both antibiotics, resulting in a cell population that is 
dependent on ligand for high-level expression of the 
exogenous nucleic acid. 

3 0 The use of invention nucleic acid constructs allows 

the stable integration and expression of an exogenous 
nucleic acid(s) or transgene into a wide variety of 
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cultured cell types. In accordance with another embodiment 
of the present invention, there are provided recombinant 
cells containing invention nucleic acid constructs encoding 
receptor peptides as described herein, optionally further 
5 containing regulatory elements operatively associated with 
exogenous nucleic acid(s) . Alternatively, recombinant 
cells constructs comprising a regulatory element, such as 
the bombyx mori receptor response element, operatively 
associated with exogenous nucleic acids, optionally further 
10 containing invention receptor peptides. 

The amount of exogenous nucleic acid introduced into 
a host can be varied by those of skill in the art. For 
example, when a viral vector is employed to achieve gene 
transfer, the amount of nucleic acid introduced can be 
15 increased by increasing the amount of plaque forming units 
(PFU) of the viral vector. 

Suitable means for introducing (transducing) 
expression vectors containing invention nucleic acid 
constructs into host cells to produce transduced 

20 recombinant cells (i.e., cells containing recombinant 
heterologous nucleic acid) are well-known in the art (see, 
for review, Friedmann, (1989) Science, 244:1275-1281; 
Mulligan, (1993) Science, 260:926-932, each of which are 
incorporated herein by reference in their entirety) . 

25 Exemplary methods of transduction include, e.g., infection 
employing viral vectors (see, e.g., U.S. Patent 4,4 05,712 
and 4,650,764), calcium phosphate transfection (U.S. 
Patents 4,399,216 and 4,634,665), dextran sulfate 
transfection, electroporation, lipofection (see, e.g., U.S. 

30 Patents 4,394,448 and 4,619,794), cytofection, particle 
bead bombardment, and the like. The transduced nucleic 
acid can optionally include sequences which allow for its 
extrachromosomal (i.e., episomal) maintenance, or the 
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transduced nucleic acid can be donor nucleic acid that 
integrates into the genome of the host . 

Exemplary eukaryotic cells suitable for introducing 
invention expression vectors include, e.g., CV-1 cells, P19 
5 cells and NT2/D1 cells (which are derived from human embryo 
carcinomas) , ES cells (embryonic stem cells) , COS cells, 
mouse L cells, Chinese hamster ovary (CHO) cells, primary 
human fibroblast cells, human embryonic kidney cells, 
African green monkey cells, HEK 293 (ATCC accession #CRL 

10 1573; U.S. Patent No. 5,024,939), Ltk" cells (ATCC accession 
#CCL1.3), COS-7 cells (ATCC under accession #CRL 1651), 
DG44 cells (dhfr - CHO cells; see, e.g., Urlaub et al . (1986) 
Cell. Molec. Genet. 12: 555), cultured primary tissues, 
cultured tumor cells, neuronal progenitor or precursor 

15 cells, such as hcn/v-myc (a tetracycline sensitive 
retroviral vector selected by G418 resistance) , neuronal 
cells lines such as cerebellum derived neuronal precursors 
and PC12 cells, neurons, primary astrocytes, 
oligodendrocytes, and the like. Presently preferred cells 

20 include CV-1 and 293 cells. 



Invention nucleic acid constructs may be stably 
incorporated into cells or may be transiently expressed 
using methods known in the art. Cells are cultivated under 
growth conditions (as opposed to protein expression 

25 conditions) until a desired density is achieved. Stably 
transfected mammalian cells may be prepared by transfecting 
cells with an expression vector having a selectable marker 
gene (such as, for example, the gene for thymidine kinase, 
dihydrof olate reductase, neomycin resistance, and the 

30 like) , and growing the transfected cells under conditions 
selective for cells expressing the marker gene. To prepare 
transient transf ectants , mammalian cells are transfected 
with a reporter gene (such as the E. coli S-galactosidase 
gene) to monitor transf ection efficiency. Selectable 
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marker genes are typically not included in the transient 
transf ections because the transf ectants are typically not 
grown under selective conditions, and are usually analyzed 
within a few days after transf ection. 

5 In accordance with a still further embodiment of the 

present invention, there are provided transgenic animals 
and methods for producing transgenic animals capable of 
prolonged and regulated expression of exogenous nucleic 
acid(s), said method comprising introducing into early- 
10 stage embryos or stem cells: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
15 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
20 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

As used herein, the phrase "transgenic animal" refers 

2 5 to an animal that contains one or more inheritable 

expression constructs containing one or more exogenous 
nucleic acid(s) under the transcription control of an 
operator and/or hormone response element as described 
herein. 

3 0 Methods of making transgenic animals using a 

particular nucleic acid construct are well-known in the 
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art. When preparing invention transgenic animals, it is 
preferred that two transgenic lines are generated. The 
first line will express, for example, a receptor peptide as 
described above (e.g., VbR) . Tissue specificity is 
5 conferred by the selection of a tissue-specific promoter 
(e.g., T-cell specific) that will direct expression of the 
receptor peptide to appropriate tissue. A second line 
contains a nucleic acid construct comprising a promoter and 
exogenous nucleic acid under the control of a regulatory 
10 element. Preferably, transgenic animals are produced by 
the transf ection/inf ection of embryonic stem cells which 
are employed to produce invention transgenic animals 
employing methods known to those of skill in art. 

In a presently preferred embodiment, an invention 
15 transgenic animal contains one or more expression 
constructs containing nucleic acid encoding receptor 
peptide and exogenous nucleic acid under the transcription 
control of a regulatory element. Thus, with tissue 
specific expression of the receptor peptide as described 
20 above and timely hormone treatment, inducible gene 
expression can be achieved with spatial, dosage, and 
temporal specificity. 

In yet another embodiment of the present invention, 
there are provided methods of inducing the transcription of 
25 an exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
30 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
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superfamily which is not normally present in the 
cells of said host, wherein expression of said 
receptor peptide is under the control of an 
inducible promoter, wherein said receptor peptide 
5 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain; and 



(iii) said ligand for said ligand binding domain, 

10 wherein said ligand is not normally present 

in the cells of said host; 



said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide . 

15 in accordance with yet another embodiment of the 

present invention, there are provided methods for the 
expression of recombinant products detrimental to a host 
organism, said method comprising: 



transforming suitable host cells with: 

2 0 (i) a nucleic acid construct comprising a promoter 

and exogenous nucleic acid(s) which express said 
recombinant product under the control of a 
regulatory element; wherein said regulatory 
element is not normally present in the cells of 
25 said host, and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 

3 0 superfamily which is not normally present in the 

cells of said host, wherein said receptor peptide 
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activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain, and 

5 growing said host cells to the desired level in the 

substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
introducing into said host cells a ligand, which, in 
10 combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

Recombinant products detrimental to a host organism 
contemplated for expression in accordance with the present 
invention include any gene product that functions to confer 

15 a toxic effect on the organism* For example, inducible 
expression of a toxin such as the diptheroid toxin would 
allow for specific ablation of tissue (Ross et al . Genes 
and Development 7:1318-1324 (1993)). Moreover, the 
numerous gene products that are known to induce apoptosis 

20 in cells expressing such products are contemplated for use 
herein (see, e.g, Apoptosis , The Molecular Basis of Cell 
Death, Current Communications In Cell & Molecular Biology, 
Cold Spring Harbor Laboratory Press, 1991) . For example, 
high level expression of 7 9 -amino acid polyglutamine tracts 

25 in cells results in apoptosis and rapid cell death. 

In accordance with still another embodiment of the 
present invention, there are provided nucleic acid 
constructs and cell lines which express a polyglutamine 
expression cassette (PQEC) under the control of an 
3 0 exogenous ligand to maximize obtaining stable 
transf ected/inf ected cultured cells which do not undergo 
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undesired apoptosis. Expression of polyglutamine tracts 
separately from the huntingtin protein facilitates 
identification of the toxic properties associated with the 
polyglutamine tracts themselves. In another embodiment of 
5 the present invention, there are provided methods for 
employing these cell lines to identify novel proteins that 
could target developing intracellular aggregates (IAs) and 
either block subsequent growth or facilitate or mark the 
IAs for degradation. 

10 In accordance with still another embodiment of the 

present invention, there are provided methods for 
modulating the transcription of nucleic acid(s) in an in 
vitro system, said method comprising administering to said 
system an amount of a ligand effective to modulate the 

15 transcription of said nucleic acid(s) ; wherein said ligand 
is not normally present in said cellular system; wherein 
said system comprises: 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 

2 0 regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 

25 super family which is not normally present in the 

cells of said host, wherein said receptor peptide 
activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 

3 0 domain, 



In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
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treatment of a host in need of gene therapy, said method 
comprising : 

introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter 
5 and said exogenous nucleic acid(s) under the 

control of a regulatory element; 



(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
10 mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein expression of said 
receptor peptide is under the control of an 
inducible promoter, wherein said receptor peptide 
15 activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

administering, to said host, ligand for said ligand 
20 binding domain. 



As used herein, the term "in vivo delivery" refers to 
delivery of biological materials by such routes of 
administration as oral, intravenous, subcutaneous, 
intraperitoneal , intrathecal , intramuscular, intracranial , 
25 inhalational, topical, transdermal, suppository (rectal) , 
pessary (vaginal), and the like. 

As employed herein, the term "ligand" (or ligand 
precursor) refers to a non-steroidal substance or compound 
which, in its native state (or after conversion to its 
3 0 "active" form) , binds to the receptor peptide, thereby 
creating a ligand/receptor peptide complex, which in turn 
can bind an appropriate response element and activate 
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transcription therefrom. Ligands function to modulate 
transcription of nucleic acid(s) maintained under the 
control of a response element. In accordance with one 
aspect of the present invention, unless and until a 
5 suitable ligand is administered to the host, substantially 
no transcription of the desired exogenous nucleic acids 
occurs . 

Preferred ligands contemplated for use in the practice 
of the present invention are ligands characterized as not 
10 normally present in the cells of the host to be treated. 
Such ligands are referred to as being exogenous to the 
host. An example of a class of ligands not naturally 
present in mammalian systems are compounds referred to as 
hydrazines, preferably diacyl hydrazines. 

15 Hydrazines contemplated for use in the present 

invention include compounds which are readily available and 
are/or relatively inexpensive to manufacture. One such 
compound, tebuf enozide, is a non-steroidal ecdysone agonist 
which is used commercially as an insecticide. This 

2 0 compound specifically targets lepidopteran species, 

including bonibyx mori. Tebuf enozide has undergone 
extensive testing in animal hosts and has proved to be of 
very low toxicity to mammals and other non-insect species. 

Exemplary hydrazines contemplated for use herein are 
25 mimics of the naturally occurring ecdysones, i.e., 
synthetic organic compounds which have transactivation 
activity characteristic of the naturally occurring 
ecdysones. Examples of such compounds, referred to herein 
as ecdysone mimics, include 1,2-diacyl hydrazines (e.g., 

3 0 tebuf enozide and others described in U.S. Patent Nos. 

5,424,333 and 5,354,762, the entire contents of each of 
which are hereby incorporated by reference herein) , 
N' -substituted-N,N* -di-substituted hydrazines (e.g., those 
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described in U.S. Patent No. 5,117,057, the entire contents 
of which are hereby incorporated by reference herein) , 
dibenzoylalkyl cyanohydrazines (e.g., those described in 
European Application No. 461,809, the entire contents of 
5 which are hereby incorporated by reference herein) , 
N-substituted-N-alkyl-N,N' -diaroyl hydrazines (e.g., those 
described in U.S. Patent No. 5,225,443, the entire contents 
of which are hereby incorporated by reference herein) , 
N-substituted-N-acyl-N-alkyl, carbonyl hydrazines (e.g. , 

10 those described in European Application No. 234,944, the 
entire contents of which are hereby incorporated by 
reference herein) , N-aroyl-N' -alkyl-N' -aroyl hydrazines 
(e.g., those described in U.S. Patent No. 4,985,461, the 
entire contents of which are hereby incorporated by 

15 reference herein), and the like. 

Since it has been previously reported that the above- 
described diacyl hydrazines are neither toxic, teratogenic, 
nor known to affect mammalian physiology, they are ideal 
candidates for use as inducers in cultured cells and 

2 0 transgenic mammals according to invention methods. 

Ligands are administered in a manner compatible with 
the route of administration, the dosage formulation, and in 
a therapeutically effective amount. The required dosage 
will vary with the particular treatment desired, the degree 
25 and duration of therapeutic effect desired, the judgment of 
the practitioner, as well as properties peculiar to each 
individual. Moreover, suitable dosage ranges for systemic 
application depend on the route of administration. It is 
anticipated that dosages between about 10 micrograms and 

3 0 about 1 milligram per kilogram of body weight per day will 

be used for therapeutic treatment. 

An effective amount of ligand contemplated for use in 
the practice of the present invention is the amount of 
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ligand (e.g., diacyl hydrazine (s) ) required to achieve the 
desired level of transcription and/or translation of 
exogenous nucleic acid. A therapeutically effective amount 
is typically an amount of a ligand or ligand precursor 
5 that, when administered in a physiologically acceptable 
composition, is sufficient to achieve a plasma 
concentration of the transcribed or expressed nucleic acid 
product from about 0.1 /zg/ml to about 100 /zg/ml, preferably 
from about 1.0 /xg/ml to about 50 /zg/ml, more preferably at 
10 least about 2 ftg/ml and usually 5 to 10 /xg/ml. 

Ligand can be administered in a variety of ways, as 
are well-known in the art, i.e., by any means that produces 
contact between ligand and receptor peptide. For example, 
such ligands can be administered topically, orally, 

15 intravenously, intraperitoneally , intravascularly, and the 
like. The administration can be by any conventional means 
available for use in conjunction with pharmaceuticals, 
e.g., by intravenous injection; either as individual 
therapeutically active ingredients or in a combination with 

20 other therapeutically active ingredients. Ligands 
contemplated for use in the practice of the present 
invention can be administered alone, but are generally 
administered with a pharmaceutical carrier selected on the 
basis of the chosen route of administration and standard 

25 pharmaceutical practice. 

Therapeutic compositions containing suitable ligand 
are preferably administered intravenously, as by injection 
of a unit dose, for example. The term "unit dose," when 
used in reference to a therapeutic composition of the 
3 0 present invention, refers to a quantity of ligand suitable 
as unitary dosage for the subject, each unit containing a 
predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with 
the required diluent, i.e., carrier, or vehicle. It may be 
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particularly advantageous to administer such compounds in 
depot or long- lasting form as discussed hereinafter. 

Suitable regimes for initial administration and 
booster shots are variable, but are typified by an initial 
5 administration followed by repeated doses at one or more 
intervals by a subsequent injection or other 
administration . Alternatively, continuous intravenous 
infusion sufficient to maintain concentrations in the blood 
in the ranges specified for in vivo therapies are 
10 contemplated. 

In accordance with still another embodiment of the 
present invention, there are provided methods for the 
treatment of a host in need of gene therapy, said method 
comprising: 

15 introducing into cells obtained from said host: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) under the control 
of a regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
20 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
25 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain; and 

to provide modified cells; 
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reintroducing the modified cells into said host; and 
administering, to said host, ligand for said ligand 
binding domain. 

In accordance with this embodiment of the present 
5 invention, the exogenous nucleic acid is introduced 
directly into cells obtained from a donor (host or separate 
donor) . Such cells are then implanted within the host. In 
a presently preferred embodiment, the transplanted cells 
are autologous with respect to the host. Autologous means 
10 that the donor and recipient of the cells are one and the 
same . 

The concept of gene replacement therapy for humans 
involves the introduction of functionally active "wild 
type" or "therapeutic" nucleic acids into the somatic cells 
15 of an affected host to correct a gene defect or deficiency. 
However, in order for gene replacement therapy to be 
effective, it must be possible to control the time and 
location at which gene expression occurs. 

Genes that encode useful "gene therapy" proteins that 
20 are not normally transported outside the cell can be used 
in the invention if such genes are "functionally appended" 
to a signal sequence that can "transport" the encoded 
product across the cell membrane. A variety of such signal 
sequences are known and can be used by those skilled in the 
25 art without undue experimentation. 

The invention will now be described in greater detail 
by reference to the following non-limiting examples. 
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Example 1 

Construction of Lepidopteran-derived fusion receptor 

peptide 

The VP16 t -domain (activation domain) is fused in 
5 frame onto bombyx mori- derived nuclear receptor (bR) at an 
internal Mlul site near the N- terminus (VbR, Figure 1A) . 
An AscI site immediately downstream of the ATG start codon 
allows multimerization of VP16 domains for VbRs with 
multiple VP16 activation domains. Note that unlike 

10 Drosophila ecdysone receptor (dEcR) , bR has virtually no 
C-terminal domain downstream of the LBD. Several amino 
acids of the hormone binding domain are replaced with 
sequences from the retinoic acid receptor or the thyroid 
hormone receptor (VbRRA and VbRTR, respectively) (Figure 

15 IB) . These mutant variations have properties slightly 
different from VbR, and in addition have silent mutations 
that eliminate Bell and BstBl sites in VbR. These are 
useful in the construction of many VbR-based retroviral 
constructs . 

20 Example 2 

Construction of multifunctional regulato ry proteins 

The small size and simplicity of the VbR system lends 
itself to the development of multifunctional regulatory 
proteins which are functional on both hormone receptor and 

25 TTA responsive promoters. VbR fusion to the tetracycline 
transact ivator results in a hybrid protein called TTMT (for 
Tebufenozide/Tetracycline Modulated Transactivator ; see 
Figure 1C) . This protein functions jointly as a 
ligand-modulated regulator of gene expression from both 

30 tetO and EcRE (ecdysone response element ) -containing 
promoters either separately or simultaneously. This 
compact protein, encoded in approximately 3 kb, confers 
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constitutive activation of promoters containing 
tetracycline operators (tetOs) in the absence of 
tetracycline/analogs . In the presence of tet analogs, the 
tetO-binding function of TTMT is blocked, deactivating 
5 responsive promoters. The VbR half of the chimeric protein 
is constitutively inactive on EcRE- containing promoters, 
but may be stimulated to transactivate responsive promoters 
to a high level by the addition of muristerone A (murA) or 
tebufenozide (teb) . Through such a dual system, two 

10 separate responsive promoters may be simultaneously 
regulated by two separate ligands through a single protein. 
Additional variants using the reverse-Tet-transactivator , 
other VbR variants, novel response elements, and other 
regulatory proteins, will presumably result in further 

15 customized variations on this same theme. 

A single transgenic animal generated with the TTMT 
protein would be simultaneously responsive to transgene 
regulation of two separate transgenes by two separate 
ligands. Further, individual promoters can be produced 
20 that respond to both halves of the protein, allowing the 
transgene to be regulated by both ligands, at the 
discretion of the investigator. 

Example 3 
Transient transfection assay 

25 Transient transfection experiments are carried out in 

CV-1 cells or 293 cells using modified drosophila ecdysone 
receptor (VdEcR) or modified bombyx hormone receptor (VbR) 
with or without dimer partners. Luc activity is examined 
40 hours after transfection and stimulation with ligand 

30 (Figure 2A) . Both receptors are assayed in CV-1 cells with 
1 /zM MurA and no exogenous dimer partner. Note that bR 
responds, while dEcR does not. Both receptors are also 
assayed in the presence of 5 /zg/ml tebufenozide (Figure 
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2B) . Note that VbR responds with greater than 200-fold 
induction while drosophila EcR does not respond at all, and 
in fact, decreases slightly. Both receptors are also 
assayed with and without dimer partners. VbR is stimulated 
5 with 5 ng/ml tebufenozide and VdEcR is stimulated with 1 jzM 
MurA. Note that RXR and Usp are necessary for VdEcR while 
VbR is constitutively activated by the presence of 
exogenous dimer partner and is inhibited by ligand 
addition. VbR, with no exogenous dimer partner, has an 
10 extremely low base-line which is stimulated 200-300 fold by 
ligand. Note also from this experiment that VdEcR tends to 
have a much higher baseline than VbR without dimer partner. 



A side-by side comparison of 2.5 fig/ml teb and 1 /iM 
MurA on VbR show both ligands are effective at stimulating 

15 VbR. Tebufenozide is administered to cells co-transf ected 
with nucleic acids encoding receptors: LINX (which encodes 
the tet-transactivator (TTA) ) , VbR, TTMT (which encodes the 
TTA-VbR fusion protein, and TTMT- 2 V which is like TTMT, but 
with two VP16 t -domains) to assay for EcRE-Luc activity 

20 (Figure 2C) . Note that teb stimulates VbR, and the TTMT 
fusions, but has no action on TTA (LINX) . The same 
experiment as that described for 2C, only with a TetO-Luc 
reporter, which responds only to TTA or fusion proteins is 
summarized in Figure 2D. Doxycycline (dox) acts to block 

25 constitutive -activity of TTA. Note that in this 
experiment, TTMT and TTMT- 2 V work better than TTA only 
(LINX) to stimulate the TetO-luc promoter/reporter. They 
are also efficiently blocked by dox. An experiment on CV-1 
cells transf ected with TTMT and two reporters (6TetO-luc 

30 and 4 -EcRE- Pgalactosidase) shows both promoters could be 
simultaneously influenced by the presence or absence of 
both ligands (Figure 2E) . 
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Example 4 

Ligand mediated transaene regulation 

Transgene regulation by teb or MurA in stably infected 
rat fibroblasts is assayed utilizing recombinant 
5 retroviral constructs as the reporter, and VbR encoding 
retroviral vectors to provide the receptor. Bulk infected 
selected populations stimulated with 2.5 ixg/ml teb or 1 jxM 
MurA indicate b-gal or tyrosine hydroxylase infected cells. 
Quantitative analysis of an individual VbR-luc clonal 
10 fibroblast line, and a VbR-GH clonal line show that 0.25 
/xg/ml teb is as effective at stimulating full activity as 
higher levels (Figure 3) . 

Example 5 

Sinale-p lasmid retroviral constructs 

15 The plasmid, pBO, utilizes a responsive internal 

promoter to regulate transgene expression and also contains 
a VbR- responsive 3 1 LTR to autoregulate the VbR receptor 
itself (Figure 4) . The plasmid, pNA, is constructing using 
a 3 ' -responsive LTR to regulate transgene expression. pBO 

2 0 utilizes an EcR responsive internal promoter and a TTA 
responsive 3 f LTR. Both are host to regulation by the TTMT 
protein, simultaneously. The purpose of this design is to 
provide chronic stimulation of the duplicated 5' LTR along 
with regulated expression of the internal EcR responsive 

25 promoter. By removing all of the native promoting 
sequences from the 6To LTR with the exception of the 
TetO's, variants of this construct are also inactivatable . 

Example 6 
Construction of plasmid vectors 

30 VbR and TTMT systems including constitutive and 

inducible promoters are integrated into a modular plasmid 
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vector known as "pBW" (Figure 5) . The purpose of pBW is to 
integrate all of the elements of bombyx receptor- or 
tetracycline-analog responsiveness into a single plasmid 
vector destined for use in producing transgenic animals by 
5 methods such as pronuclear injection. pBW is to facilitate 
construction of transgenic animals or stably transfected 
cells using VbR variants. The pBW design simplifies the 
insertion of responsive promoters, transgenes, constitutive 
promoters, and the various VbRs, TTMTs, and related 
10 regulatory proteins. With pBW, the transgene regulatory 
properties of generated animals should be more predictable 
and reproducible from founder to founder than multi -vector 
systems . 



There are two major components of this system: the 
15 first is a tissue-specific expression cassette to produce 
the receptor and/or antibiotic resistance genes. The 
second is an expression cassette with a responsive promoter 
and a transgene. Polyadenylation signals flank all of the 
expression cassettes to provide efficient polyadenylation 
20 (p(A)) of transcribed RNAs. The use of rare restriction 
endonuclease sites within strategic locations and 
polylinkers of pBW will ease cloning of transgenes. An 
adjunct shuttle vector derived from pBSK (Stratagene) 
called SKSP contains rare sites flanking the polylinker and 
25 compatible with pBW. pBW also shares a number of sites 
with pBO to further simplify construction. pBW is 
constructed within the plasmid pcDNA3 (Invitrogen) . 

Example 7 
Regulating transgene 
3 0 expression in target cells in vitro 



Individual desired elements can be inserted 
sequentially into a recombinant retrovirus. Recombinant 
retrovirus starts as a polylinker composed of 5 1 Notl-Mlul- 
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NruI-EcoRI-AscI-Pmll-BstBI-BarnKI-Hindlll-Hpal-Clal-Nsil- 
Kpnl 3' sites. The retroviral y gin (see Adam and Miller 
in .7. Virol. 62:3802 (1988) and Barklis et al . , in Cell 
47:391 (1986)) is inserted into sites of the recombinant 
5 retroviral polyl inker. The internal CMV promoter is 
inserted BamHI-Clal into the BanHI-Clal sites of the 
evolving recombinant retroviral vector. All LTRs destined 
for insertion into the 5' location are produced by low- 
cycle/high-fidelity PCR production and end primers with 

10 compatible Notl-Mlul sites for insertion into the Notl-Mlul 
sites. 3' LTRs are generated with Clal-Nsil compatible 
ends for insertion into these sites. The R region of the 
3' LTR is included in the downstream primer sequence. 
Transgenes, receptor cDNAs , or selectable marker genes are 

15 inserted into the remaining polyl inker from the KcoRI site 
through the BarriHI site, or the remaining polylinker after 
the internal promoters (ffindlll-Clal) . 

The resulting vectors are characterized by restriction 
digests and mapping, amplified by large-scale plasmid 

20 preparation, and prepared for transfection into packaging 
cells by standard methods. Transient retroviral production 
in 293 and 293T cells has been previously described (see 
Pear et al., in PNAS 90:8392 (1993)). Forty-eight hours 
after transient retroviral production, the conditioned 

25 media is removed, filtered through 0.45 mm filters, and 
frozen at -70°C until use. 10-cm dishes of primary-cultured 
Fisher rat abdominal fibroblasts (Rosenberg et al., Science 
242:1575 (1988)) at approximately 50% density are infected 
with l/io volume of virus-containing media and 8 /xg 

30 polybrene/ml for 48 hours. Infected cells are then 
selected under the appropriate antibiotic until non- 
infected cells are cleared from the population 
(approximately 10 days) . Resistant colonies are 

trypsinized, pooled and passaged. The resulting population 
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is then infected with the second virus, and reselected 
again in media containing both antibiotics. The infection 
and selection process for the second round of viral 
infection proceeds as described for the first round. The 
5 final population of doubly resistant primary fibroblasts 
are then pooled and passaged to 6 or 24 -well Costar plates 
for assay in vitro. 

The short (24 hour) course is performed in 24-well 
Costar dishes with cells at approximately 80% confluency. 

10 5 /xl of ligand (final concentration of 1 fM) or vehicle 
(20% EtOH) is added to the 1 ml of media in each well. 
Eight hours after first ligand administration, the cells 
are all washed in PBS and the media is replaced. (3- 
galactosidase histochemical reaction is performed on 1.5% 

15 glutaraldehyde- fixed cells essentially as described by 
Shimohama et al. in Brain Res Mol Brain Res 5:271 (1989) in 
a 37°C environment for 2 hours. The 4 -day time course is 
performed in an identical fashion, except that the cell 
population is plated at only 20% confluency at the start of 

20 the experiments (and grow to near-total confluency by the 
end of the 4 day period) and neither ligand nor media is 
changed or replaced. 

For long-term culture experiments, MMBG fibroblasts 
are plated in triplicate at high density in 24-well plates. 

25 Within 48-72 hours of plating, when the cells reach 100% 
confluency within the wells, 1 /xM ligand or vehicle is 
added to one group of plates, while others are left 
completely untreated. The wells are then placed in a 37°C- 
10% C0 2 environment for the next 25 days without any media 

30 changes. On day 23, the plates that had received neither 
vehicle nor ligand on day 1 are removed and half of the 
wells are given vehicle and the other half are given 1 fiH 
ligand for 4 0 hours. 
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After 25 days of nutrient deprivation and contact 
inhibition, all of the plates are processed for Pgal 
histochemical staining as above. 



MMGH fibroblasts are produced by infecting a 
5 fibroblast line with the recombinant retroviral vectors 
containing desired elements. Cells are selected under both 
antibiotics as described previously. After preliminary 
examination of ligand- induced hGH production in the bulk 
population, the MMGH population is plated at high density 

10 in 6-well Costar plates. Forty-eight hours after the 
initial plating and when the cells are essentially 
completely growth inhibited by contact, the medium of all 
plates is replaced with DME containing 2% FBS. Under these 
conditions, primary rat fibroblasts stop dividing even if 

15 they are not confluent, and settle out into a distended 
morphology with prominent nuclei characteristic of severely 
growth-arrested fibroblasts. Six days after 

acclimatization to this culture environment, 1 ml of medium 
is harvested from each well and plate of the experiment and 

20 stored frozen at -20°C. The remaining unused media is 
discarded at the same time of harvest and replaced with 
fresh DME-2% FBS. Groups of wells are treated or not with 
1 fiM ligand. Hormonally treated wells are washed with 
medium to remove as much residual ligand as possible prior 

2 5 to medium replacement when ligand treatment is 

discontinued. This routine is repeated daily for the 21- 
day extent of the experiment . 

After all time-point samples have been collected and 
frozen at -20°C / they are simultaneously processed for the 

3 0 presence of the hGH protein by ELISA (Boehringer Mannheim) 

following the protocol recommended by the manufacturer. 
ELISA data are quant itated on an MR700 microplate reader 
(Dynatech Laboratories, Chant illy, VA) and compared to a 
standard curve generated using purified hGH to determine 
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picogram GH amounts. Control wells are TH (tyrosine 
hydroxylase) producing fibroblasts, but display all of the 
characteristics of the recombinant retrovirus except for 
the hGH transgene and secreted protein. 

5 Example 8 

Expression Vectors and Tra nsient Transfection Assays 

All receptors were subcloned into vector LNCX (A.D. 
Miller, GenBank accession no. M28247, with an extended 
polylinker) for use in transient transfection assays. VBR 

10 (or CVBR) when referring to the retrovirus) was produced by 
insertion of VP16 (14) sequences with a synthetic ATG start 
codon into the amino-terminal region of BE up to, and 
in- frame with, the Mlul site (corresponding to amino acid 
26). VP1 6 primers: (5') 5 ' -GAGAGAAGCTTATGGCGCGCCCGACCGATG 

15 ( SEQ ID NO : 5 ) and ( 3 ' ) 5 ' - C ACACACGC GTGTACT CGT CAATTC CAAG 
(SEQ ID NO:6). VDE (or CVDE) was produced in a similar 
fashion by fusing VP16 sequences in- frame at a novel Ncol 
site corresponding to amino acid 68. The downstream primer 
was made Ncol compatible. Plasmid template (100 ng) , each 

20 primer (500 ng) , and reaction conditions outlined by the 
manufacturer for Pwo (Boehringer Mannheim) high fidelity 
polymerase, were used with a program of 1 min at 94 °C, 
1 min at 45°C, and 1 min at 72°C for 20 cycles for 
production of all PCR products used herein. LNCX-Usp and 

25 RXR were produced from EcoRI fragments encoding this 
complete cDNAs, filling with the Klenow fragment of DNA 
polymerase, and inserting the fragments into the tfpal site 
of LNCX. Orientation was determined by restriction 
endonuclease digestion. Transient transfection in CV-1 and 

3 0 293 cell was performed by using standard methods (Sambrook 
et al (1989) Molecular Cloning: A Laboratory Manual 2 nd 
edition) in triplicate in 24-well plates by calcium 
phosphate coprecipitation of 100-ng receptor (s), reporter 
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plasmid E4-luc, and pCHHO as an internal control. 
Briefly, E4-luc is four tandem EcREs inserted upstream of 
a thymidine kinase gene minimal promoter directing 
luciferase expression EcRE oligonucleotides were as 
5 described (Thomas et al . (1993) Nature 362:471-475) with 
BamHI-Bg/lI compatible ends. Ligand (1 /zM) was added at 
the time of transf ection, and 40 h later cells were lysed 
and extracts were used for (J-galactosidase assay and 
measurement of luciferase activity in an analystical 
10 bioluminescence photometer. 

CVDEiR and MS vectors used only as retroviruses in 
production of stable cell lines were produced as follows: 
The VDEiR expression cassette was constructed by modifying 
the ATG start codon of human RXRa to contain an overlapping 
15 BstXI site for fusion into the ATG start codon of the 
0.8-kb EMCV IRES (Jang et al (1989) J. Virol 63:1651-1660 
and Hoshimaru et al (1996) PNAS 93:1518-1523) sequence. 
The IRES-RXR cassette was then subcloned downstream of VDE 
in LNCX-ofVDE (CofVDE) , a VDE variant vector with lower 

2 0 basal transact ivat ion levels in superphysiological RXR 

environments, to produce CVDEiR. The MS retroviral 
reporter was produced by removing the Moloney murine 
leukemia virus enhancer core from the 3' long terminal 
repeat (LTR) with Nhel-Xbal digestion and replacing this 
25 sequence with four tandem EcREs, resulting in a reporter 
analogous to the E4-luc reporter plasmid. 

Example 9 
Gel Mobility Shift Assays 

Gel mobility shift analysis was performed by in vitro 

3 0 translation of receptor ORFs subcloned into pBSK 

(Stratagene) , pGEM-3 (Promega), and PSL3 01 (Invitrogen) in 
the presence of [ 35 S]Met by using T3/T7 TNT (Promega) 



WO 99/02683 



PCT/US98/14215 



62 

transcription/translation and following the manufacturer's 
protocol. In vitro translated proteins were qualitatively 
examined by 5% SDS/PAGE with protocols as described 
elsewhere (Sambrook et al (1989) Molecular Cloning: A 
5 Laboratory Manual 2 nd edition) and quantified by 
Phosphorlmager (Molecular Dynamics) exposures of dried 
gels. Amounts of proteins used in gel shift assays were 
normalized by using quantitative data and correction for 
predicted Met residues in individual constructs. 

10 Double -stranded EcRE probes corresponding to response 
elements described above were labeled by filling of the 
Klenow fragment of DNA polymerase with [ 32 P] dCTP and 
unlabeled dGAT by standard methods. Reaction conditions 
for protein-probe interaction and gel electrophoresis were 

15 essentially as described by Yao et aJ . ((1992) Cell 71:63- 
72) except, to facilitate comparison between samples, 
reaction mixtures (including dimer partners) and probe) 
were prepared as a mixture and distributed equally to 
individual tubes with receptor proteins. The reactions 

20 were allowed to proceed at 23 °C for 5 min. at which time 
ligand or vehicle was added and the reaction allowed to 
continue for an additional 20 min. 

Example 10 

Construction of bowbvx mori receptor (BE) - drosophila 
25 melanocraster ecdvsone receptor (DE) Chimeric Receptors 

Chimeric receptors were produced by taking advantage 
of conserved DNA sequences and sites within the receptors 
and by introducing compatible sites by low cycle, high 
fidelity PCR (described above) of bowbyx mori receptor 
30 (BE) and drosophila melanogaster ecdysone receptor (DE) 
templates to produce fusion proteins. DEBE-A/B was 
produced by PCR and replaced the BE /AB domain (amino acids 
1-197, as set forth in SEQ ID NO: 3) with amino acids 1-255 
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of DE introducing a novel Apal restriction site. DEBE-C 
replaced the A/B and C domains (amino acids 1-273, as set 
forth in SEQ ID NO: 3) of BE with amino acids 1-331 of the 
DE protein by introduction of a compatible Bagl site into 
5 BE at the fusion junction. DEBE-D replaced BE from amino 
acids 1-363, as set forth in SEQ ID NO: 3, with amino acids 
1-430 of DE by Kpnl digest. BEDE was the reverse of this 
construct and replaced amino acids 1-430 of DE with amino 
acids 1-363 (as set forth in SEQ ID NO:3) of BE by using 

10 Kpnl partial digest. DEBH was produced from DEBE-C by 
removing all BE sequences except the D domain by Kpnl 
digestion and inserting sequences encoding the intact E and 
F domains (amino acids 430-878) of the original DE after 
partial Kpnl digestion. VEH was produced by excision of 

15 the central region of DEBH, including the heterologous 
hinge domain, and inserted into VDE, replacing the 
analogous region. 

Example 11 

Construction of E Domain Chimeric Receptors 

20 The chimera, BEDB, was produced by insertion of the 

654 -bp KpnI-Bg/II fragment encoding precisely the DE 
hormone -binding domain into the identical analogous sites 
in BE. Two unique sites within the hormone -binding domain 
of BEDB were used in the production of the chimeras. A 

25 unique Aatll site lies approximately 120 bp downstream of 
the 5' JCpnl site, and a unique EagI sites lies 
approximately 440 bp downstream. PCR was used as described 
above, and corresponding fragments of BE with appropriate 
compatible ends were generated and subcloned into BEDB 

30 digested with Kpnl + Aatll (BKE) , Aatll + EagI (BAE) , Aatll 
+ Bg/II (BAB), and Eagl-Bg/II for BEB. The resulting 
constructs were translated, examined by SDS/PAGE, 
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quantified, and normalized as described above. Conditions 
for gel mobility shift assay were also as described above. 

Example 12 

Infection of Cells with Recombinant Retroviruses 

5 Transient restroviral production in 293 cell types has 

been described (Pear et al . ((1993) PNAS 90:8392-8396). 
Forty-eight hours after transient retroviral production, 
the conditioned medium was removed, filtered through 
0.45-ptm filters, and frozen at - 70°C until use. 

10 Ten-centimeter dishes of primary- cultured Fischer rat 
abdominal fibroblasts (Schinstine et al . (1992) Neurochem 
58:2019-2029) and 293 and CV-1 cells at approximately 50% 
density were infected at a multiplicity of infection of 
approximately 0.05 with virus -containing media and 8 mg/ml 

15 Polybrene for 48 h. Infected cells were then selected with 
4-6 mM L-histidinol until noninfected cells were cleared 
from the population: approximately 10 d for FF12s , 21 
dafor 293s, and >30 d for CV-ls (CV-1 cell populations were 
resistant to rigorous selection with i . -histidinol) . 

20 Resistant colonies from multiple plates were trypsinized, 
pooled, and passaged. The resulting bulk population was 
then split and infected with receptor-bearing virus and 
reselected again in medium containing both L-histidinol and 
400-800 /zg/ml G418 . The infection and selection process 

25 for the second round of viral infection proceeded as 
described for the first round. The final population of 
doubly resistant primary fibroblasts was then pooled and 
passaged in triplicate in 6- or 24 -well Costar plates for 
P-galactosidase staining by standard methods or lucif erase 

3 0 assay. 



WO 99/02683 



PCT/US98/14215 



65 

Example 13 

CV-1 Cells Support BE-Mediated Transact ivat ion Without 
Exogenous Dimer Partner 

Both receptors were tested to determine the effect 
5 that differential endogenous dimer partner availability had 
on transactivation of responsive promoters. Relative 
luciferase activity employing the reporter plasmid, E4-luc, 
were performed from VDE and VBR transiently cotransf ected 
into 293 or CV-1 cells with either Usp, RXR, or no added 
10 heterodimer partner. Luciferase activity was assayed in 
the presence of either vehicle, 1 /xM murA, or 1 jxM 
tebuf enozide . 

In the presence of an equimolar exogenously added 
dimer partner, VDE and VBR transactivation was similar in 

15 both cell types. With Usp, both proteins were induced less 
than 2-fold by ligand and displayed a high level of basal 
transactivation. With RXR, VDE displayed an average 
relative 5. 35- fold induction across both cell types, 
whereas VBR was induced only 2. 3 5- fold, even though the 

20 absolute level of induction matched or exceeded the 
expression with VDE. The decreased relative induction of 
VBR + RXR resulted from approximately doubled basal 
activity levels compared with VDE + RXR. With no exogenous 
dimer partner, both receptors in both cell types exhibited 

25 dramatic 15- to 80-fold decreases in basal transactivation. 

With no exogenous dimer partner, VDE failed to respond 
to tebuf enozide, whereas VBR continued to respond well to 
both murA and tebuf enozide . The addition of RXR to 
VDE-transf ected 293 cells increased the maximum 
3 0 murA- treated expression level by only 20%, indicating that 
the high level of endogenous RXR in 2 93 cells is near 
saturation for heterodimerization with DE. By comparison, 
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CV-1 cells supported murA stimulation of VDE at only 13% of 
the expression level of VDE with added RXR (Fig. IB) . VBR 
was active at levels similar to VDE in the high RXR 
background of the 293 cells; however, in the CV-1 cells, 
5 tebuf enozide-stimulated VBR exhibited both the highest 
absolute level of transactivation and the greatest relative 
induction (160.2-fold) of any of the other combinations of 
receptors and ligands tested. In the CV-1 cells, 
tebuf enozide-stimulated VBR displayed 21-fold greater 
10 relative induction and an absolute expression level 9.25 
times the level of VDE treated with ntutA. 

Example 14 

BE Displays Facilitated Heterodimer Formation with Usp 

and RXR 

15 The high level function of VBR in the low heterodimer 

partner environment of the CV-1 cells suggested that BE 
might be capable of efficient function with lower levels of 
dimer partner than DE. To explore this possibility, 
cell -free experiments with gel mobility shift analysis and 

20 equivalent amount of in vitro translated BE and DE in 
combination with Usp or RXR dimer partners and both ligands 
were performed. Neither protein alone formed a dimeric 
complex with the EcRE probe unless either the Usp or RXR 
dimer partner proteins were added. In the presence of Usp, 

25 DE formed a comparatively weak interaction with the EcREs 
that was increased approximately 4 -fold by murA and 2- fold 
by tebuf enozide . In contrast, BE produced a strong shift 
with Usp that was only slightly influenced by either 
ligand. In the absence of any ligand, BE bound probe 5 

30 times more efficiently than DE. With ligand, the BE probe 
shift was observed to still exceed the DE shift and was 
double the DE + Usp band volume. Both receptors combined 
with RXR displayed an absolute dependence on ligand for 
heterodimer formation; however, whereas BE + RXR with 



WO 99/02683 



PCT/US98/14215 



67 

either ligand displayed a prominent shifted band, an 
equivalent amount of DE resulted in a barely detectable 
shifted band with murA and no detectable band shift with 
tebufenozide. The BE + RXR tebuf enozide- and murA-induced 
shifts were determined to be >15 times the DE + RXR shift 
with murA. 



10 



15 



30 



Example 1 5 

BE-DE Chimeras Reveal That- h he D Domain Is Hr-iM^I rr, 
High Affdn-it- y Heterodimerizat.-i o n 



Gel mobility shift assays indicated that BE had higher 
affinity heterodimer formation than DE. To further define 
the subdomains and molecular determinants resulting in the 
BE high affinity phenotype, PCR mutagenesis and internal 
shared restriction endonuclease sites were used to produce 
BE-DE chimeric receptors that were assayed for their 
ability to bind the EcRE probe with either Usp or RXR and 
with or without ligand. Because previous results indicated 
that murA stimulated both EcRs, for purposes of direct 
comparison only murA was used for gel shift analysis of the 
20 chimeras. Fusion constructs began at the N terminus of BE 
and sequentially replaced the A/B (DEBE-A) , C (DEBE-C) , 
and D (DEBE-D) domains with DE sequences. BEDE was the 
reverse, encoding BE up to the D-E domain boundary with the 
DE E and F domains. DEBE-A and DEBE-C with Usp shifted 
25 nearly as efficiently as native BE. The absence of shift 
for DEBE-D, however, and the substantial shift seen for 
BEDE + Usp suggested that the high affinity determinant for 
BE-Usp heterodimerization resided in the D, or hinge 
domain. The same constructs in combination with RXR 
revealed a different shift pattern. BE and DEBE-A both 
responded strongly to ligand to bind and shift the probe. 
DEBE-C with ligand was approximately 40% decreased relative 
to native BE; however, it still displayed 16-fold greater 
binding than native DE. DEBE-D had lost all high affinity 
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DNA-binciing complex formation and functioned 
indistinguishably from the native DE with or without 
ligand. MurA-stimulated BEDE + RXR produced a band shift 
4 times the intensity of the native DE + RXR stimulated 
5 with murA, but BEDE was clearly significantly impaired to 
heterodimerization with RXR compared with native BE and 
other higher affinity chimeras like DEBE-A and DEBE-C. 

For further confirmation of the role of the BE D 
domain in dimer partner affinity, the D region of DE was 

10 replaced with the BE D domain to produce DEBH. Native DE 
and DEBH were compared side by side for binding to EcREs 
with both Usp and RXR dimer partners. Gel mobility shift 
of in vitro translated and normalized DE and 
hinge -substituted DEBH was performed, in the presence of 

15 either Usp or RXR, and further in the presence of vehicle 
or 1 iM murA. Compared with DE + Usp, DEBH + Usp averaged 
5- fold greater probe binding with murA and 9- fold greater 
binding without ligand. DEBH + RXR exhibited both 
decreased probe shift in the absence of ligand and 

20 increased probe binding with murA (approximately 3 -fold 
over native DE) for a 10-fold relative induction by ligand 
compared with 2.5-fold for DE. Transient transfection 

analysis of a VP16-DEBH construct (VEH) in CV-1 cells 
revealed that VEH shared characteristics of both DE and BE. 

25 VEH displayed the high basal transactivation level and 

low relative induction (average of 2 . 5 fold) characteristic 
of BE with superphysiological RXR. VEH displayed 5- to 
8 -fold greater basal and murA-activated expression than 
native DE in the absence of any exogenous dimer partner. 

30 MurA-stimulated VEH expression exceeded even the expression 
level for murA-stimulated BE by nearly 2-fold. As 
predicted, without the BE hormone -binding domain, DEBH did 
not to respond to tebuf enozide . Additional chimeric 
proteins with a subset for BED-region sequences did not 
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function as well as DEBH with replacement of the entire D 
domain, suggesting that multiple discrete determinants 
presumably over much of the hinge region contribute to the 
high affinity phenotype. 

5 Example 16 

Discrete Function al Regions Within the E Domain Revealed 

bv BE -DE Chimeras 

Although the preceding data reveal that the BED domain 
alone was sufficient for high affinity heterodimerization 

10 with Usp, a comparison of the relative shift of the DEBE-D 
and BEDE chimeras with both partners suggested that 
additional determinants within the BE hormone -binding E 
domain contributed to a high affinity interaction with RXR. 
To further study the BE E domain, chimeric proteins were 

15 produced by using BEDE as a template. Five E-domain 
chimeras named BEDB, BKE, BAE, BAB, and BEB were prepared. 
BEDB is identical with BE with the exception of complete 
replacement of the E domain with corresponding sequences 
from DE, BEDB was identical with BEDE with the exception of 

2 0 replacement of the large C- terminal DE F domain for the 

20-amino acid F domain of BE. BKE, BAE , BAB, and BEB are 
derived from BEDB and are sequential, C- terminal moving 
replacements of three regions within the DE-derived E 
domain. Gel mobility shift assays with BE, BEDE, and BEDB 
25 revealed that BEDE and BEDB are essentially identical in 
their properties of complex formation, suggesting that the 
F domain does not play a significant role in 
heterodimerization and DNA binding. In addition, BEDB 
recapitulates the high affinity USP binding and the low 

3 0 affinity RXR binding of BEDE. 

The BEDB-derived chimeric receptors reveal several 
functional subdomains over most of the BE E domain. By 
using unique internal sites, the E domain was subdivided 
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into approximately thirds called E x E 2 and E 3 . Treatments 
were with vehicle (-) , 1 /xM murA (M) , or 1 fiM tebufenozide 
(T) . Chimera BKE with replacement of the DE E 1 and E 2/ 
regions and chimera BAE with replacement of only the E 2 
5 region displayed very similar patterns of shift. Both 
chimeras were significantly impaired in complex formation 
with Usp and RXR relative to the original BEDB chimera, 
suggesting that there were fundamental incompatibilities 
between subdomains of the Bombyx and Drosophila E domains. 

10 The similarity of these two constructs suggested that DE 
and BE E x regions were likely to be functionally similar, 
but that the E 2 regions had very different properties. 
Curiously, although both basal and murA- stimulated 
heterodimerization were inhibited, BKE/BAE-Usp 

15 heterodimerization was significantly stimulated by 
tebufenozide, indicating that the tebufenozide response 
determinant is within E 2 but that the high affinity RXR- 
binding determinant lies elsewhere. 

When E 2 was combined in tandem with E 3 in the chimeric 
2 0 receptor BAB, high affinity heterodimer formation 
indistinguishable from native BE heterodimerization was 
observed with both Usp and RXR. To determine whether the 
E 3 region alone could confer high affinity 
heterodimerization with RXR, the chimera BEB, with 

2 5 substitution of only the E 3 region, was produced and tested. 

BEB displayed a pattern of response essentially identical 
with the original BEDB with low- level interaction with RXR 
and only upon stimulation with murA. BEB, which lacked the 
E 2 region of native BE, was not responsive to tebufenozide. 

3 0 Taken together, these results indicate that the BE D domain 

and E 3 region, in combination with ligand-binding 
determinants within the E 2 region, function in concert to 
produce high affinity, ligand- dependent heterodimerization 
between BE and the RXR protein. 
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Example 17 

Comparison of VBR and VDE Function in Stablv Transduced 

Cells 

Building on transient transfection and gel shift data, 
5 DE an BE variants were directly compared for their ability 
to function at low or single copy in stably transduced cell 
types. In addition to direct comparison of CVDE and CVBR 
retroviruses, an RXR-bearing VDE variant called CVDEiR was 
also tested that previous studies had determined to 

10 provide the best combination of low basal expression and 
relatively high induction for VDE cotransf erred with 
supplemental RXR. These three receptor-bearing vectors and 
an empty LNCX control were used to infect 293, CV-1, and 
primary fibroblast cells harboring an EcRE-responsive 

15 retroviruses termed MS. MS encoded a 3' LTR with the core 
enhancer elements replaced with four tandem EcREs and an 
internal simian virus 40 promoter vector directing 
resistance to the drug L-histidinol . Parallel MS vectors 
encoding either the LacZ gene (MS-Z) or a luciferase 

20 transgene (MS-luc) were used in this comparative study, the 
former as an indication of the number of responding cells 
and the latter for quantitative purposes. 

Three cell types, 293s, CV-Is, and rat primary 
fibroblasts (line FF12) , were selected for assay of 

25 reporter/receptor virus combinations based on their 
relative capacities to supports high level DE-mediated 
transactivation. The human 293 cell line support 
essentially full DE activation with no requirement for 
added RXR, whereas CV-ls require the contransf ection of 

30 exogenous dimer partner for DE function. FF12s were 
selected as the third cell type because they are refractory 
to chemical methods of transduction and reliant on the use 
of biological vectors such as recombinant retroviruses for 
efficient gene transfer. Furthermore, primary fibroblasts 



WO 99/02683 



PCT/US98/14215 



72 

are frequently used as autologous donor cells for 
transplantation approaches to somatic gene therapy. 

All three cell types were infected with both the 
reporter and receptor-bearing vectors and treated for 72 h 
5 with either vehicle, 1 fM murA or 1 /xM tebuf enozide . 
Histochemical staining of MS-Z infected cells indicated 
that all three receptor types were capable of high level 
ligand-responsive transactivation in coinfected 293 cells. 
DE-derived constructs responded well to murA but were 

10 unable to respond strongly to tebuf enozide . CVBR, on the 
other hand, responded moderately to murA but strongly to 
tebuf enozide. MS-LacZ cell types infected with an empty 
LNCX vector displayed no staining when stimulated with any 
ligand and further confirmed observations from 

15 vehicle-treated cells that the MS vector permitted only 
relatively low basal levels of transgene expression. 
Approximately 30% of CVDE-, CV-DEiR- , and CVBR-inf ected 
MS-Z 293 cells responded to murA. Tebuf enozide stimulated 
<1% of CVDE-infected cells and <4% of CVDEiR- infected MS-Z 

20 293s. More than 50% of CVBR-inf ected MS-Z 293s responded 
to tebuf enozide, however, exceeding all DE-based vectors 
with murA. More dramatic differences were seen in CV-1 and 
FF12 MS-Z infected cells. CVDE alone was not inducible in 
either cell type, indicating that the endogenous level of 

25 RXR in either CV-ls or primary fibroblasts is not 
sufficient to support VDE-mediated transactivation. When 
the endogenous level of RXR is supplemented, as in CVDEiR, 
3-10% of the infected population was observed to strongly 
respond to murA. CVBR, however, was clearly the most 

3 0 potent transact ivator in the CV-1 and FF12 cell types with 
20% of CV-1 and 55% of CVBR-inf ected FF12 cells 
histochemically positive. 
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Luciferase activity in MS-Luc cells infected in 
parallel with the same receptor viruses confirmed that CVBR 
was not the most potent transact ivator in the CV-1 and FF12 
cell types and that all receptor types were functional in 
5 293 cells. The MS-Z data suggest that the low absolute 
level of CVDEiR induction in CV-1 and FF12 cells with murA 
most likely reflects a small number of robustly responding 
cells in a largely nonresponsive population as opposed to 
low level induction by most MS-luc-inf ected cells. 

10 Example 18 

Transient transf ections 

Transient transf ections are performed by calcium- 
phosphate coprecipitation employing standard methods (see 
Sambrook et al., in Molecular Cloning: A Laboratory Manual. 

15 Cold Spring Harbor Press, New York, New York (1989)) . All 
tissue culture experiments are performed using DMEM 10% FBS 
in a 10% C0 2 incubator unless otherwise specified. All 
transf ections are performed in triplicate in 24 -well Costar 
plates using CV-1 cells at an approximately 5X10 4 plating 

20 density. Immediately following transf ection, 1 /xm ligand 
in 20% EtOH/PBS is added to wells. After 40 additional 
hours of incubation, cells are harvested and luciferase 
activity measured in an analytical bioluminescence 
photometer. Cell extracts are simultaneously examined for 

25 LacZ activity of the internal control by standard methods. 
Bar graph levels represent relative luciferase activity 
after correction using internal control values. All 
molecular biology enzymes and reagents used in this study 
are provided by either NEB (Beverly, MA) or Strategene (La 

3 0 Jolla, CA) . 
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Example 19 
Prolonged transgene expression for 
ex vivo gene therapy applications 

Autologous, explanted skin fibroblasts genetically 
5 modified to express tyrosine hydroxylase (TH) , the enzyme 
responsible for synthesis of L-dopa and the precursor for 
the neurotransmitter dopamine, have proven to ameliorate a 
loss of local dopamine in animal models of Parkinson's 
disease. Although quite effective in providing dopamine to 

10 the area of neural damage, this technique is only 
therapeutically useful for 2-3 weeks following 
transplantation. The decrease in effectiveness has been 
traced back to a dramatic loss of TH transgene expression 
in transplanted, post-mitotic cells. The use of invention 

15 constructs allows one to overcome the loss of transgene 
expression by providing stimulation of the retroviral LTR 
promoter through either ligand-activated transactivating 
complexes or through constitutive transactivating receptor 
variants. In this way, transgene expression may be 

20 maintained for longer periods of time, even indefinitely if 
desired. 

Stem- type cells, such as stem cells of the 
hematopoetic system, nervous system, or embryo, could be 
infected or transfected with VbR regulated transgenes and 

25 subsequently implanted into adult, fetus, or early 
embryonic animals for either therapeutic or research 
purposes. Cells of the hematopoetic system could 
conditionally express proteins producing blood clotting 
factors such as factor IX, metabolic factors such as 

3 0 glucocerebrosidase, or protective factors including 
anti-HIV proteins. 
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Example 2 0 
Prolonged transaene expression for 
in vivo crene therap y applications 

The small size and regulatory capacity of VbRs lend 
5 themselves to use in recombinant retroviruses as a method 
of gene transfer. VbRs have been introduced into both MLV 
and lent iviral -based retroviral systems. When these 
viruses are introduced directly into target cells of either 
a mature or developing organism, expression of the virally 
10 encoded transgenes may be regulated by systemic addition of 
ligands such as tebufenozide and derivatives. An example 
of a disease that could be theoretically ameliorated by 
application of in vivo VbR encoding retroviruses is 
Parkinson's disease, described above. 



15 Another application would be to use VbR encoding 

retroviruses as an anti-viral agent. Lentiviral vectors 
with regulated properties and harboring suicide genes or 
"protective" proteins could be used as a means of 
conditionally depleting or destroying HIV-positive cells. 

20 Example 21 

Modulated transaene expression for either 
in vivo or ex vivo gene therapy application 

Treatment of Parkinson's disease with the chemical 
precursor of dopamine, L-dopa, has proven effective in 

25 ameliorating many of the deficits of Parkinsonism. With 
time, however, patients become refractory to L-dopa 
therapy, with the deleterious effects of chronic treatment 
outweighing even the serious symptoms of the disease 
itself. Eventually, patients are left with few therapeutic 

3 0 options. While the transplantation of TH expressing cells 
may be effective when constantly producing low-levels of L- 
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dopa, a potentially far more beneficial approach would be 
to allow the physician some degree of control over L-dopa 
production in the patient. This would allow sufficient 
control to ensure that the transgenic factor is expressed 
5 at appropriate therapeutic levels. At times when 
endogenous systems are capable of providing full function, 
the transgene may be allowed to become quiescent and 
transcriptionally inactive until needed again. Because the 
transcriptional induction of the invention retroviral 
10 constructs is dependent on an exogenous ligand, expression 
of an integrated therapeutic transgene can be placed under 
the control of the physician and patient. 

Example 22 

Construction of a BORIS vector expressing PolvO-GFP 

15 PolyQ-GFP are constructed from an existing GFP clone, 

GFP-SKSP. PCR primers encoding a novel restriction site 
and ATG start codon at the 5' end, and an Age I site 
in- frame with the GFP ORF at the 3' end are obtained for 
PCR amplification of huntingtin polyQ regions. These PCR 

20 products are inserted in-frame into GFP-SKSP to produce the 
fusion constructs of pQ-GFP. The clones are analyzed for 
mutations and function by sequencing and in vitro 
translation to confirm the creation of a pQ-GFP ORF. 
Alternate antibiotic resistance BORIS regulatable 

25 retroviral vectors are produced by removing the 
L-Histidinol gene from plasmid LSHL, by PCR, and inserting 
it in place on the neo r into a BORIS vector for production 
of BORIS-LHis; The plasmid are large-scale prepped. A 
preliminary test for function is transient transfection 

30 experiments into transf ectable cell types such as 293 cells 
and examination for the expression of beta-galactosidase or 
GFP reporter genes. Retrovirus are transiently produced 
and used for infection of target cells. Infected cells are 
selected in L-Histidinol containing media to ensure proper 
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selection. polyQ-GFP is then inserted into regulated 
retroviral vectors. pQ-GFPs are excised from SKSP and 
inserted into the regulatable BORIS and BORIS-LHis vectors. 
The plasmids are large-scale prepped. Function is 
5 determined by examination for green fluorescence and 
altered phenotype. Additional plasmid are used for 
transient production of retroviruses. 

Primary rat fibroblasts, and the hcnlv-myc are selected for 
initial testing of BPQGS. Both are infected with the 

10 appropriate vector (s) place under selection, and individual 
resulting colonies subcloned. The subclones are expanded 
and passaged for analysis. Infected cells are treated or 
not with their respective ligands and the resulting 
phenotypes analyzed. Preliminary analysis will examine GFP 

15 fluorescence and distribution within the cells. At timed 
intervals following ligand induced expression, the pattern 
of fluorescence are recorded to determine if the fusion 
protein changes in distribution. Morphological 
characteristics of stimulated cells their growth rates, and 

20 cell death will also be examined at intervals following 
induction. For the hcn/v-myc cells, the consequences of 
polyQ-GFP expression in the presence and absence of 
tetracycline are compared. 

Example 23 

25 Aggregation of polvO- fusion proteins 

GFP fusion proteins with long (97Q) , short (13Q) , or 
no N- terminal polyQ tracts were constructed. Unique 
restriction endonuclease sites were designed into the 
regions flanking the polyQ tracts to allow insertion of an 
30 SV40 nuclear localization signal (NLS) or other sequences 
either C-terminal to the polyQ tract. Transient 
transfection analysis of these constructs in the NIT 
expression vector revealed that cells transduced with GFP 
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only or 13Q-GFP expressed GFP uniformly throughout their 
cytoplasm and that the NLS variant 13QN-GFP localized 
predominantly in the nucleus with a uniform distribution. 
97Q- and 97QN-GFP constructs, on the other hand, initially 
5 manifested as uniform GFP positive fluorescence throughout 
the cytoplasm or nucleus, but within 24-72 hours after 
transfection condensed into bright granules indicative of 
intracellular aggregates (IAs) . IAs were localized within 
the cytoplasm, surrounding the nuclear envelope, and within 
10 the nucleus itself (Figure IB) . The presence of the SV40 
nuclear localization signal (NLS) , though observed to 
efficiently direct localization of either 13Q- or 97Q-GFP 
variants to the nucleus, resulted in a quantifiable 
difference in the rate of IA formation after transfection. 

15 Example 24 

GFP-fusion proteins with short polyglutamine tracts are 

recruited into IAs 



It was sought to determine whether overexpression of 
13Q-GFP constructs would inhibit or interfere with the 

20 formation of IAs from 97Q-fusion constructs . 13Q-GFP was 
not observed to exert any inhibitory influence on 97Q-GFP 
mediated IA formation, and contrary to expectations, 
appeared to result in a qualitative difference in the IAs 
that formed. To test whether the 13Q-GFP fusion proteins 

2 5 were participating in the formation of IAs, the 
GFP-encoding portion of 97Q-GFP (97QA) was deleted and this 
construct was cotransf ected with 13Q-GFP into cultured 
293 cells. Fluorescent IAs were readily observed in a 
large population of the GFP-positive cells within the 24-72 

30 hr. time course observed for 97Q-GFP alone. 13Q-GFP/97QA 
doubly infected cells were distinctive from 97Q-GFP cells 
because they continued to display faint uniform cytoplasmic 
GFP positivity in addition to the formation of 
predominantly nuclear IAs, whereas IA formation within 
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97Q-GFP cells generally results in condensation of all GFP 
fluorescence into IAs leaving other areas of the cell 
nonf luorescent and dark. Control cotransf ection of 
NIT-97QA with NIT-GFP constructs with no N- terminal 
5 polyglutamine tracts did not result in the formation of IAs 
under any conditions tested, indicating that the short 
polyQ tract of 13Q-GFP is necessary for participation in IA 
formation. 

Preliminary studies of 13Q-GFP recruitment were 
10 performed with a molar excess of 97QA plasmid. To further 
determine the molar ratio of 97Q to 13Q required for 
participation of 13Q in IA formation, 13Q-GFP was titrated 
with 97QA in a cotransf ection assay and the percent of IA 
containing cells quantified relative to the total number of 
15 visibly GFP positive cells. No IAs were observed with non 
cotransf ected 97QA or at a 0 . 1 molar ratio of 97QA plasmid; 
however, at a 0.5 molar ratio of 97QA to 13Q-GFP, 10% of 
GFP positive cells contained GFP positive IAs. At a 1:1 
ratio, nearly 40% of GFP positive cells also displayed IAs. 
20 At 10:1 ratio of NIT-97QA to NIT-13Q-GFP, over 75% of 
positive cells displayed IAs 60 hours post- transf ection . 

Example 2 5 

Characterization of IAs formed from a combination of long 
and short polvQ- reporters 

25 To better characterize the formation of IAs in the 

presence of both long and short polyQ tracts, 97QA was 
modified to contain an in- frame fusion of the LacZ gene 
encoding p-galactosidase (97QZ) . Preliminary transient 
transf ection studies of 97QZ revealed that like 97Q-GFP, 

30 97QZ condensed within transfected cells to form both 
immunopositive and histochemically reactive IAs. NIT-97QZ 
and NIT-13Q-GFP were cotransf ected into cells and 6 0 hours 
later, performed immunohistochemistry and histochemical 
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reactions on the transfected cells to simultaneously 
examine both reporter constructs. Although the 

3-galactosidase (3-gal) histochemical reaction obscures GFP 
fluorescence, through slight modification of standard 
5 staining techniques and limiting the time of exposure to 
reagents, a low level of GFP fluorescence could be 
preserved and imaged along with the characteristic blue 
stain of the 3-galactosidase reaction. A population of IAs 
are observed that are both GFP and 3-galactosidase 
10 positive. Numerous large 3-gal only positive IAs are also 
present, presumably from the aggregation of 97QZ only (or 
predominantly) . Curiously, several IAs that appear only 
positive for GFP were also observed using the histochemical 
stain and native GFP fluorescence imaging. 

15 Further exploration of this observation was performed 

using confocal microscopy and immunohistochemistry to 
identify reporters within IAs. As in the above study, 
individual IAs were clearly observed to be positive for 
both reporters while other IAs were positive only for 3-gal 

20 for GFP. Taken together, these results clearly indicate 
that long polyglutamine tract -reporter fusion constructs 
form IAs in vitro, and that the forming aggregates can 
either recruit short polyglutamine-reporters into the 
aggregate, induce them to aggregate on their own, or both. 

25 Example 2 6 

Construction of cel l lines with regulated IA formation 
and 970-GFP expression 



The study of IA formation has thus far relied heavily 
on transient transfection analysis and has been hampered by 
30 a lack of stable IA-forming cultured cell populations. 
Through the use of retroviral vectors, a variety of stably 
cultured cell lines conditionally or chronically expressing 
the polyQ-GFP constructs were constructed. Two regulatable 
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vector systems, an induced expression system based on a 
modified ecdysone receptor variant (VbR) and a repressed 
expression system based on the tetracycline transactivator 
(TTA) , were used to create cell lines with inducible IA 
5 formation. These two vectors are best suited to different 
ranges of induction in cultured cell types. VBR-based B3 
vectors are particularly suited to applications where very 
low basal expression levels that can be rapidly stimulated 
to moderately high levels are desired. The TTA-based NIT 

10 vector tends not have the broad-based low level of 
"uninduced" expression characteristic of the BORIS vectors 
(identified in SEQ id NO:6); however, because 
transactivator expression and transactivation is continuous 
in the absence of ligand, very high transgene expression 

15 levels may be achieved through the use of this vector. 

97Q-GFP was cloned via a custom shuttle vector into 
both retroviral vectors. Transiently produced B3-polyQ-GFP 
retroviruses (B3-13Q or B3-97Q) and TTA-based NIT polyQ-GFP 
retroviruses (NIT-13Q or NIT-97Q) were used at low MO I to 

20 infect several well-characterized cell lines used 
previously in transduction studies of IA formation. COS -7 
cells were selected because several studies had examined IA 
formation and IA- induced apoptosis in this cell type. 
293 cells were utilized for the same reason and also 

25 because they are amenable to efficient transfection and 
would have significant utility in the development of 
gene-based factors that interact with or interrupt IA 
formation. Primary rat fibroblasts were selected because 
they are non-transformed and readily enter a prolonged 

3 0 post -mitotic state following contact inhibition at high 
culture densities. 

Pilot studies with reporter constructs revealed that 
the B3 vector functioned most efficiently in the 293 and 
COS cell lines and that NIT functioned at highest 
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efficiency in the primary fibroblast line. Individual cell 
lines infected with the appropriate vector were placed 
under G418 selection to remove uninfected cells. For 
B3-polyQ infected 293 and COS-7 cells, the resulting 
5 resistant population was pooled and passaged without 
ligand. For NIT-polyQ infected primary fibroblasts, cells 
were cultured with the tetracycline analog doxycycline 
prior to, during, and after infection to prevent chronic 
polyQ overexpression to disrupt growth of resistant 
10 colonies. 293 and COS-7 cells were also infected with NIT 
constructs and cultured without added doxycycline to 
determine the effects of chronic overexpression of 97Q-GFP 
on these cell types as described below. 

Example 2 7 

15 Stable IA-forming 293 cells 

G418-resistant B3-97Q 293s displayed little to no 
visible GFP fluorescence during culture in the absence of 
ligand; however, introduction of 1 /xM tebufenozide into the 
culture medium resulted in a rapid increase of cytoplasmic 

20 fluorescence within 24 hours post - induction . Nearly 100% 
of cells would display GFP-positive fluoresence within 
24-72 hours of induction. After 24 hours, only widely 
scattered cells would contain small, detectable IAs . By 
48 hours, >8% of the cell population would contain one or 

25 more IAs increasing to >15% of the population between 48 
and 72 hours. Occasional IA containing cells would be 
observed floating in the culture medium; however, there was 
no evident "wave" of cell death at the 72 hour time point 
or later times. Passage of B3-97Q 293s in the presence of 

3 0 ligand for three passages resulted in only a small but 
detectable decrease in the percentage of cells containing 
IAs (15% to 11%) . 
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NIT-97Q infection sheds further light on the toxic 
potential of long-polyglutamine repeats on 293 cells. 
Surprisingly, colonies of 100% IA-positive 293 cells 
survived for over two weeks and apparently divided and 
5 grew, in some cases, to colonies of hundreds of cells 
(Figure 4D) . By the end of 21 days of culture; however, 
the majority of IA positive colonies had disappeared 
leaving only scattered IA positive cells and numerous 
colonies of GFP positive cells without IAs that eventually 
10 grew to fill the plates. The expression time-course and 
level of B3- AND NIT-13Q cells paralleled the 97Q cells 
although no IAs were formed. 

Example 2fi 
Stable IA-formina COS-7 cells 

15 G418-resistant B3-97Q C0S-7s displayed no visible GFP 

fluoresence during culture in the absence of ligand in a 
manner almost identical to the 293 B3-97Q population. 
Addition of 1 \M tebufenozide into the culture medium 
resulted in a rapid increase of cytoplasmic fluorescence 

20 within 24-48 hours post -induction. IA formation peaked at 
4-5% of total cells (at any given moment) 72-96 hours 
post- induction. Unlike the B3-97Q 293 cells, a continuous 
population of G418 -resistant cells did not display even GFP 
fluoresence suggesting that one or more components of the 

25 regulatable system were lost from the COS population. Like 
B3-97Q 293 cells, IA containing COS-7 cells were also 
observed floating in the culture medium; however a 
depletion of the expressing population was not observed 
presumably because the continuous formation of IAs in other 

30 cells. 

The effects of chronic 97Q-GFP overexpression in 
infected COS-7 cells was examined using the NIT vector. 
Like NIT-97Q infected 293s, colonies in which the majority 
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of cells contained large and numerous IAs were readily- 
observed. Unlike the overexpressing 293 cells; however, 
several selected colonies continued to grow and express IAs 
in a continual percentage of the population even after 

5 passage. Three lines, COSN97Q-4, 6 and 7, continually 
express IAs after continuous growth for over four weeks. 
COSN97Q-7 displays the highest level of spontaneous IA 
formation. At any given time, approximately 6% of the 
total population contains readily observable IAs. CV-1 

0 cells, a predecessor of the COS-7 cells were also infected 
with NIT-97Q and formed IA containing colonies in much the 
same manner as COS cell populations. 



Example 2 9 
Stable IA formin g primary f ibrobl asfR 

15 B3-97Q infected primary fibroblasts (FF12s) became 

faintly GFP fluorescent within 72 hours of tebufenozide 
treatment but did not achieve a maximum level of 97Q-GFP 
expression sufficient to result in IA formation in a 
significant population of cells. The NIT vectors were 

20 employed to generate regulated 97Q-GFP expression. 
Infected NIT-97Q FF12s were infected and cultured 
continuously in the presence of doxycycline (dox) to 
prevent IA formation. Twelve individual colonies of 
NIT-97Q FF12s were selected in order to screen for 

25 individual cell populations with low dox inhibited 
expression and high 97Q expression in the absence of 
ligand. One line, FFN97Q-5 displayed GFP fluoresence 
within 90% of the confluent cell population within 5 days 
of ligand removal. IAs began forming on day 5 and 

30 continued to increase through day 8. Approximately 15% of 
the total cell population displayed IAs 10 days post ligand 
removal. IA formation in the confluent FFN97Q-5 population 
was distinctive from other cell types examined in that it 
was occasionally exclusively nuclear and at other times 
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predominantly cytoplasmic in an unusual pattern within the 
cell body. IAs with a distinctive stellate or tri -partite 
appearance were often observed. Large cytoplasmic 
aggregates of multiple star-like IAs were also occasionally 
5 seen. Curiously, confluent FFN97Q-5 cells with multiple 
cytoplasmic or nuclear IAs appeared visually normal. 
Floating dead cells with or without IAs were rarely 
observed and in numbers no different from control wells. 
FF970Q-%s cultured in the presence of dox neither expressed 
10 visible 97Q-GFP nor formed IAs. 

Example 30 
Use of invention for efficient 
production of transgenic animals 

Transgenic animals are generally produced by either 

15 pronuclear injection of DNA or by transfection of embryonic 
stem (ES) cells followed by selection and injection of the 
stem cell into the inner cell mass of very early embryos. 
Pronuclear injection results in approximately 5-10% stable 
gene transfer in the production of transgenic mice. The 

2 0 use of ES cells in producing transgenics is likewise 
inefficient in generating mosaics with germ-line 
transmission of the transgene. It was proposed in the mid- 
1980' s to use retroviruses to transfer transgenes with high 
efficiency into early embryos or ES cells to dramatically 

25 enhance the odds of producing transgenic animals. All 
attempts at this failed, not because the virus was 
incapable of stably integrating into the target cell 
genome, but because the integrated provirus did not express 
any of the genes encoded within the viral transcriptional 

30 cassette. 

The present invention is capable of overcoming the 
transcriptional block to result in germ-line transgenic 
animals with full expression from the integrated transgene. 
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In addition to expressing the transgene, the level of 
transcription may still be regulated by controlling the 
supply of ligand to the transgenic animal. The increased 
efficiency of producing transgenic animals by retroviral 
5 infection should open up the way to producing mutant 
animals of a variety of species previously impractical for 
genetic modification because of the potential cost of 
producing a large number of non-positive animals by 
classical methods. 

10 To produce transgenic mice, the following nucleic acid 

constructs are prepared and subsequently injected into 
fertilized eggs: CD3-VbR and a ligand inducible S-gal 
reporter. Two separate lines of transgenic mice are 
generated harboring either a ligand inducible reporter, or 

15 a T-cell specific expression construct of VbR, 
respectively. The former are referred to as reporter mice, 
the latter are referred to as receptor mice, and double 
transgenic mice are referred to as receptor/reporter mice. 
Constructs CD3-VbR are injected, while the reporter is 

2 0 injected alone. Primary genotyping is performed by 

Southern blot analysis and the transmission of transgenic 
mice is monitored by dot blot analysis. Receptor mice are 
analyzed for VbR expression by Northern blot analysis of 
RNA collected from these mice. For Northern blot analysis, 
25 15^g of total RNA obtained from the thymus, and various 
tissues as a control, is run on a denaturing gel and 
blotted onto a nitrocellulose membrane. The blot is probed 
with a radiolabeled &-gal-specif ic probe and exposed on 
film for 2 days. In addition, the transgene can be 

3 0 transferred to the offspring as expected by Mendelian 

genetics . 
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Example 31 

Use of retroviral constructs in the invention for 
efficient gene transfer to developing embryos 

Since the present invention can effectively overcome 
5 the block of viral expression in embryonic cells, invention 
constructs are a potent tool in the delivery of transgenes 
to somatic tissues of a developing embryo. With many 
diseases, considerable damage is done during embryonic 
development so that therapies applied after birth are 
10 essentially ineffective to ameliorate the disease 
phenotype . 

The present invention provides methodology where one 
can infect cells of the embryo and can provide therapeutic 
factors to the developing fetus either constitutively, or 
15 under the regulation of exogenously produced ligand. 

Example 32 
Use of vector constructs in invention 
with inducible high titers 

One obstacle to the use of retroviruses as gene 

2 0 transfer agents is that titers of retroviruses from 

existing producer cell lines are only on the order of 1X10 4 
or 1X10 5 . By using a retroviral construct of the invention 
having intact enhancers and regulatory elements, expression 
of the retrovirus may be induced by greater than ten- fold, 
25 resulting in correspondingly higher titers of infectious 
virus . 

While the invention has been described in detail with 
reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 

3 0 the spirit and scope of that which is described and 

claimed. 
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That which is claimed is: 

1. A nucleic acid construct encoding a receptor 
peptide comprising : 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily 

(ii) an activation domain, and 

(iii) a DNA binding domain; 

wherein said receptor peptide activates regulatory 
element (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

2. A nucleic acid construct according to claim 1 
wherein said ligand for said ligand binding domain is a 
non-steroidal compound not normally present in a 
mammalian cell. 

3. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from a lepidopteran species . 

4. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from bombyx mori. 

5 . A nucleic acid construct according to claim 1 
wherein said ligand binding domain has substantial 
sequence identity with amino acid sequence 3 93-508, as 
set forth in SEQ ID NO: 3. 

6 . A nucleic acid construct according to claim 1 
wherein said ligand binding domain has substantial 
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sequence identity with amino acid sequence 393-586, as 
set forth in SEQ ID NO: 3. 

7. A nucleic acid construct according to claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID N0:3. 

8 . A nucleic acid construct according to claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID NO:3. 

9. A nucleic acid construct according to claim 1 
wherein said DNA binding domain is obtained from a DNA- 
binding protein. 

10. A nucleic acid construct according to claim 9 
wherein said DNA binding domain is obtained from a member 
of the nuclear receptor superfamily. 

11. A nucleic acid construct according to claim 10 
wherein said DNA binding domain is non- mammalian. 

12. A nucleic acid construct according to claim 1 
wherein said receptor peptide has substantial sequence 
identity with the amino acid sequence set forth in SEQ ID 
NO 5. 

13. A nucleic acid construct according to claim 1, 
wherein said nucleic acid construct further comprises a 
regulatory element operatively associated with exogenous 
nucleic acid(s) . 
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14 . A nucleic acid construct according to claim 13 
wherein said regulatory element comprises a hormone 
response element . 

15. A nucleic acid construct according to claim 14 
wherein said construct comprises a plurality of hormone 
response elements. 

16. A nucleic acid construct according to claim 13 
wherein said exogenous nucleic acid(s) is (are) selected 
from wild type and therapeutic nucleic acid(s) . 

17. A nucleic acid construct according to claim 16 
wherein said wild type nucleic acid(s) is (are) selected 
from genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in said host; or 

a substantial excess of which leads to the 
occurrence of a non-normal state in said host. 

18. A nucleic acid construct according to claim 17, 
wherein said exogenous nucleic acid is a CAG repeat 
encoding a polyglutamine polypeptide. 

19. A nucleic acid construct according to claim 16 
wherein said therapeutic nucleic acid(s) is (are) 
selected from those which encode products: 

which are toxic to the cells in which they are 
expressed; or 

which impart a beneficial property to said 

host . 

20. A nucleic acid construct according to claim 19 
wherein said therapeutic nucleic acid(s) modulates the 
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transcription and/or translation of endogenous nucleic 
acid(s) . 

21. A nucleic acid construct according to claim 1, 
further comprising nucleic acid encoding a selectable 
marker . 

22 . A nucleic acid construct according to claim 21 
wherein said selectable marker is a radiolabeled 
molecule, a fluorescent molecule, an enzyme, or a ligand. 

23 . A nucleic acid construct according to claim 22 
wherein said enzyme is an antibiotic resistance gene or a 
gene which enables cells to process a metabolic 
intermediary. 

24. An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 1. 

25. An expression cassette according to claim 24, 
wherein said promoter is selected from inducible, 
constitutive and/or tissue specific promoters. 

26. A viral vector capable of ligand-mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 24. 

27. A viral vector according to claim 26, wherein 
said viral vector further comprises a retroviral psi M 
packaging signal and a 5' and/or a 3- long terminal 
repeat (LTR) . 
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28. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 1. 

29. A cell according to claim 28, wherein said cell 
is further transformed with a reporter vector comprising 
a regulatory element operatively associated with 
exogenous nucleic acid(s) . 

30. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 13. 

31. An animal cell according to claim 30, wherein 
said cell is an embryonic stem cell. 

32. A transgenic animal, wherein cells of said 
animal is transfected with a nucleic acid construct 
according to claim 1. 

33. A transgenic animal, wherein cells of said 
animal is transfected with a nucleic acid construct 
according to claim 13 . 

34. A polypeptide receptor encoded by a nucleic 
acid construct according to claim 1. 

35. A nucleic acid construct encoding a receptor 

peptide comprising : 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
super family, and 

(ii) a DNA binding domain, wherein said DNA binding 
domain is not derived from said non-mammalian 
member; 

wherein said receptor peptide activates regulatory 
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element (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

36. A nucleic acid construct according to claim 35, 
wherein said receptor peptide further comprises: 

(iii) an activation domain. 

37. A nucleic acid construct according to claim 35, 
wherein said nucleic acid construct further comprises a 
regulatory element operatively associated with exogenous 
nucleic acid(s) . 

38. An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 35. 

39. A viral vector capable of ligand-mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 38. 

40. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 35. 

41. A cell according to claim 40, wherein said cell 
is further transformed with a regulatory element 
operatively associated with exogenous nucleic acid(s) . 

42. A polypeptide receptor comprising: 
a VP16 activation domain, 

a DNA binding domain, and 

bombyx mori-derived ligand binding domain and hinge 
region . 
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43. A nucleic acid construct comprising a 
regulatory element derived from a bombyx receptor 
response element operatively associated with exogenous 
nucleic acid (s) . 



44 . An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 43 . 



45. An animal cell according to claim 44, wherein a 
polypeptide receptor is further introduced into said 
cell, said receptor comprising: 

an activation domain, 
a DNA binding domain, 

and the ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily , 

wherein said receptor peptide activates regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

46. A transgenic nonhuman mammal, wherein cells of 
said mammal are transfected with the nucleic acid 
construct according to claim 43 . 

47. A method for modulating the transcription of 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; and 

(ii) a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
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region of a non- mammalian member of the nuclear 
receptor superfamily which is not normally 
present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain, 

said method comprising administering to said host an 
amount of ligand effective to modulate the transcription 
of said exogenous nucleic acid(s) ; wherein ligand is not 
normally present in the cells of said host. 

48. A method according to claim 47 wherein said 
receptor peptide is provided by a nucleic acid construct 
encoding said receptor peptide. 

49. A method according to claim 48 wherein said 
receptor peptide is expressed under the control of a 
tissue specific promoter. 

50. A method according to claim 47 wherein said 
exogenous nucleic acid(s) is (are) selected from wild 
type and therapeutic nucleic acid(s) . 

51. A method according to claim 50 wherein said 
wild type nucleic acid(s) is (are) selected from genes 
which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in said host; or 

a substantial excess of which leads to the 
occurrence of a non-normal state in said host. 
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52. A method according to claim 50 wherein said 
therapeutic nucleic acid(s) is (are) selected from those 
which encode products: 

which are toxic to the cells in which they are 
expressed; or 

which impart a beneficial property to said 

host . 

53. A method according to claim 50 wherein said 
therapeutic nucleic acid(s) modulates the transcription 
and/or translation of an endogenous nucleic acid(s). 

54. A method according to claim 47 wherein said 
regulatory element is not normally present in the cells 
of said host. 

55. A method according to claim 47 wherein said DNA 
binding domain is obtained from a DNA-binding protein. 

56. A method according to claim 55 wherein said DNA 
binding domain is obtained from a member of the nuclear 
receptor superf amily . 

57. A method according to claim 55 wherein said DNA 
binding domain is a GAL4 DNA binding domain. 

58. A method according to claim 47 wherein said 
receptor peptide further comprises an activation domain. 

59. A method according to claim 47 wherein said 
host further contains a non-mammalian transactivator not 
normally present in the cells of said host and a 
transactivator responsive regulatory element not normally 
present in cells of said host, 
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wherein said transactivator responsive 
regulatory element controls transcription of said 
exogenous nucleic acid(s) or a second nucleic acid 
construct comprising a second exogenous nucleic 
acid (s) . 

60. A method according to claim 59 wherein said 
exogenous nucleic acid{s) is (are) under the further 
control of said transactivator responsive regulatory 
element . 

61. A method according to claim 59 wherein said 
transactivator responsive regulatory element controls 
transcription of a second DNA construct comprising a 
second exogenous nucleic acid(s) . 

62. A method according to claim 59 wherein said 
transactivator responsive regulatory element comprises an 
operator which is responsive to said non-mammalian 
transactivator which confers responsiveness to 
antibiotics . 

63 . A method according to claim 59 wherein said 
transactivator is a ligand-mediated receptor which 
confers responsiveness to antibiotics. 

64. A method according to claim 47 wherein said 
hinge region has substantial sequence identity with amino 
acid sequence 273-362, as set forth in SEQ ID NO:3. 

65. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with amino acid sequence 393-508, as set forth in SEQ ID 
NO: 3 . 
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66. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with amino acid sequence 3 93-586, as set forth in SEQ ID 
NO : 3 . 

67. A method according to claim 47 wherein said 
receptor peptide has substantial sequence identity with 
the sequence set forth in SEQ ID NO: 3, 

68. A method according to claim 47 wherein said 
receptor peptide has the sequence set forth in SEQ ID 
NO: 3 . 

69. A method according to claim 47 wherein said 
ligand for said ligand binding domain is a non-steroidal 
compound. 

70. A method according to claim 69 wherein said 
non-steroidal compound is a hydrazine. 

71. A method according to claim 70 wherein said 
hydrazine is a 1,2-diacyl hydrazine, an N' -substituted- 
N,N ! -disubstituted hydrazine, a dibenzoylalkyl 
cyanohydrazine, an N-substituted-N-alkyl-N,N-diaroyl 
hydrazine, an N-substituted-N-acyl-N-alkyl , carbonyl 
hydrazine, and an N-aroyl-N 1 -alkyl-N 1 -aroyl hydrazine. 

72. A method according to claim 71 wherein said 
hydrazine is a diacyl hydrazine. 

73 . A method of inducing the transcription of an 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
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and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 

comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain, and 



(iii) said ligand for said ligand binding 

domain, wherein said ligand is not 
normally present in the cells of said 
host ; 



said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide. 

74 . A method of inducing the transcription of an 
exogenous gene in a host containing a nucleic acid 
construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a regulatory 
element; said method comprising introducing into a host: 

a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 
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receptor superfamily which is not normally present 
in the cells of said host, wherein said receptor 
peptide activates said regulatory element in the 
absence of an exogenous dimer partner therefor and 
in the presence of a ligand for said ligand binding 
domain, and 

a ligand for said ligand binding domain. 

75. A method for the treatment of a host in need of 
gene therapy, said method comprising: 

introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain; and 

administering, to said host, ligand for said ligand 
binding domain. 

76. A method for the treatment of a host in need of 
gene therapy, said method comprising: 
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introducing into cells obtained from said host: 

(i) a nucleic acid construct comprising a promoter 

and exogenous nucleic acid(s) under the control 

of a regulatory element; and 



(ii) nucleic acid endocing a receptor peptide 

comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain, 

to provide modified cells; 

reintroducing the modified cells into said host, and 
administering, to said host, ligand for said ligand 
binding domain. 



77. A method for producing transgenic animals 
capable of prolonged and regulated transcription of 
exogenous nucleic acid(s) , said method comprising 
introducing into early-stage embryos or stem cells: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
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superfamily which is not normally present in 
the cells of said transgenic animal, wherein 
said receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain. 

78. A method for the expression of recombinant 
products detrimental to a host organism, said method 
comprising: 

transforming suitable host cells with: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) which express 
said recombinant product under the control of a 
regulatory element; wherein said regulatory 
element is not normally present in the cells of 
said host, and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner for 
said ligand binding domain, and 

growing said host cells to the desired level in the 
substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
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introducing into said host cells said ligand, which, in 
combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

79. A method according to claim 78, wherein said 
exogenous nucleic acid is a CAG repeat which encodes a 
polyglutamine polypeptide. 

80. A method for identifying compounds useful for 
treating neuodegenerative conditions, said method 
comprising: 

introducing a test compound into a host cell 
transformed according to claim 79, and 

identifying those test compounds that target 
intracellular aggregates of polyglutamines as compounds 
useful for treating neuodegenerative conditions. 

81. A method for modulating the transcription of 
nucleic acid(s) in an in vitro system, said method 
comprising administering to said system an amount of said 
ligand effective to modulate the transcription of said 
nucleic acid(s); wherein said ligand is not normally 
present in said cellular system; wherein said system 
comprises : 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 
regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
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the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain. 

82. A nucleic acid construct comprising nucleic 
acid encoding a VP16 activation domain, a DNA binding 
domain and bowbyx mori- derived ligand binding domain and 
hinge region. 

83 . A nucleic acid construct comprising a promoter 
and nucleic acid encoding a tetracycline-controlled 
transact ivator, a VP16 activation domain, a DNA binding 
domain and bowbyx mori- derived ligand binding domain and 
hinge region, in operative association with one another. 

84. A vector comprising: 

a first reporter under the control of a regulatory 
element, 

a second reporter under the control of an operator 
which is responsive to a ligand-mediated receptor which 
confers responsiveness to antibiotics, and 

a nucleic acid construct comprising a promoter 
operatively associated with nucleic acid encoding a 
tetracycline-controlled transact ivator , a VP16 activation 
domain, a DNA binding domain and Jbombyx mori -derived 
ligand binding domain. 
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B3 - V polyQ - GFP constructs to date 
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SEQUENCE LISTING 
<110> The Salk Institute for Biological Studies 

<120> Modified Lepidopteran Receptors and 

Hybrid Mult i -Functional Proteins for Use in Transcription 
and Transgene Expression Regulation 



<130> SALK2070WO 

<140> PCT/US98/14215 
<141> 1998-07-10 

<150> 60/091874 
<151> 1998-07-07 

<150> 08/891,298 
<151> 1997-07-10 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 71 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> DNA binding domain of member of steroid/ thyroid 
hormone receptor superfamily 





<400> 1 


























Cys 


Xaa Xaa Cys 


Xaa 


Xaa 


Asp 


Xaa Ala Xaa Gly Xaa Tyr Xaa 


Xaa 


Xaa 


1 




5 










10 










15 




Xaa 


Cys Xaa Xaa 


Cys 


Lys 


Xaa 


Phe 


Phe 


Xaa 


Arg 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 




20 










25 










30 






Xaa 


Xaa Xaa Xaa 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Cys 




35 








40 










45 








Xaa Xaa Xaa Lys 


Xaa 


Xaa 


Arg 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Cys 


Arg 


Xaa 


Xaa 




50 






55 










60 










Lys 


Cys Xaa Xaa 


Xaa 


Gly 


Met 




















65 






70 























<210> 2 
<211> 2711 
<212> DNA 
<213> Bombyx mori 

<220> 
<221> CDS 

<222> (185) . . . (2002) 
<400> 2 

ctccgttgac gacggtcgcg cgtgcggtac gtccgttttt acggctcaaa cgtacacggt 60 
aacctccgtc tctgcatcat cggcggaact cgtgaaattc gcgtgctttt ctcacctgtt 120 
gaacgagttg tgttgtgact gaaaaacatc atcacaaata tcaagcttca aaactaataa 180 
gtga atg aga gtc gag aac gtg gat aac gta teg ttt get ttg aac gga 229 
Met Arg Val Glu Asn Val Asp Asn Val Ser Phe Ala Leu Asn Gly 
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15 10 15 

cgc get gac gag tgg tgt atg tct gta gag acg cgt tta gat agt tta 277 
Arg Ala Asp Glu Trp Cys Met Ser Val Glu Thr Arg Leu Asp Ser Leu 
20 25 30 

gtg cga gaa aaa agt gaa gtg aaa gec tac gtc gga gga tgt ccc teg 325 
Val Arg Glu Lys Ser Glu Val Lys Ala Tyr Val Gly Gly Cys Pro Ser 
35 40 45 

gta ate acg gat get gga gcg tat gac gcg etc ttc gac atg aga cgc 373 
Val He Thr Asp Ala Gly Ala Tyr Asp Ala Leu Phe Asp Met Arg Arg 
50 55 60 

C 9 C tgg tct aat aac ggt ggc ttc ccg ctg cga atg ctt gaa gag age 421 
Arg Trp Ser Asn Asn Gly Gly Phe Pro Leu Arg Met Leu Glu Glu Ser 
65 70 75 

tct tea gaa gtg aca teg tct teg gca ctg ggt ttg cca ccg gee atg 469 
Ser Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu Pro Pro Ala Met 
BO 85 90 95 

gtt atg teg ccg gaa tec ttg gcg teg ccc gag tat cga gee etc gag 517 
Val Met Ser Pro Glu Ser Leu Ala Ser Pro Glu Tyr Arg Ala Leu Glu 
100 105 110 

eta tgg age tac gat gac gga ate act tat aat aca gec cag tct ctg 565 
Leu Trp Ser Tyr Asp Asp Gly He Thr Tyr Asn Thr Ala Gin Ser Leu 
115 120 125 

ctg ggt gca tgc aat atg caa cag caa cag eta caa cct cag caa cca 613 
Leu Gly Ala Cys Asn Met Gin Gin Gin Gin Leu Gin Pro Gin Gin Pro 
130 135 140 

cat cca gca cca ccg acg etc ccc acg atg cct tta cca atg cct ccc 661 
His Pro Ala Pro Pro Thr Leu Pro Thr Met Pro Leu Pro Met Pro Pro 
145 150 155 

aca aca ccg aaa tea gaa aat gaa teg atg tea tea ggt cga gag gaa 709 
Thr Thr Pro Lys Ser Glu Asn Glu Ser Met Ser Ser Gly Arg Glu Glu 
160 165 170 175 

ctt teg ccg get tea age ata aat ggc tgc agt get gat get gac gec 757 
Leu Ser Pro Ala Ser Ser He Asn Gly Cys Ser Ala Asp Ala Asp Ala 
180 185 190 

aga egg cag aag aaa ggt cct gca cct cga cag caa gag gag eta tgt 805 
Arg Arg Gin Lys Lys Gly Pro Ala Pro Arg Gin Gin Glu Glu Leu Cys 
195 200 205 

ctt gtc tgc ggc gac aga gee tec gga tac cac tac aac gca ctg acg 853 
Leu Val Cys Gly Asp Arg Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr 
210 215 220 

tgt gaa gga tgc aaa gga ttc ttc agg egg agt gtc ace aaa aac gca 901 
Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Val Thr Lys Asn Ala 
225 230 235 

gta tat att tgt aaa ttt gga cat gee tgt gaa atg gat atg tac atg 949 
Val Tyr lie Cys Lys Phe Gly His Ala Cys Glu Met Asp Met Tyr Met 
240 245 250 255 
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agg agg aaa tgt caa gag tgt cga tta aag aaa tgt eta gcg gta gga 997 
Arg Arg Lys Cys Gin Glu Cys Arg Leu Lys Lys Cys Leu Ala Val Gly 
260 265 270 

atg agg cct gaa tgt gtc ata cag gag ccc agt aaa aat aaa gac agg 104 5 
Met Arg Pro Glu Cys Val lie Gin Glu Pro Ser Lys Asn Lys Asp Arg 
275 280 285 

caa aga caa aag aaa gac aaa gga ata tta tta cct gtt agt acg acc 1093 
Gin Arg Gin Lys Lys Asp Lys Gly lie Leu Leu Pro Val Ser Thr Thr 
290 295 300 

aca gtc gaa gac cac atg ccc ccg ate atg caa tgt gat cca cct ccg 1141 
Thr Val Glu Asp His Met Pro Pro lie Met Gin Cys Asp Pro Pro Pro 
305 310 315 

ccc gag gec gec agg att cac gaa gtc gtc ccg agg tat ctt teg gag 1189 
Pro Glu Ala Ala Arg lie His Glu Val Val Pro Arg Tyr Leu Ser Glu 
320 325 330 335 

aag ctg atg gag cag aac agg cag aag aac ata cca cca ttg teg gcg 123 7 
Lys Leu Met Glu Gin Asn Arg Gin Lys Asn He Pro Pro Leu Ser Ala 
340 345 350 

aat cag aag tct ctg ate gcg agg etc gtg tgg tac cag gag gga tat 1285 
Asn Gin Lys Ser Leu He Ala Arg Leu Val Trp Tyr Gin Glu Gly Tyr 
355 360 365 

gag cag ccc tec gac gag gat etc aaa aga gta acg cag act tgg cag 1333 
Glu Gin Pro Ser Asp Glu Asp Leu Lys Arg Val Thr Gin Thr Trp Gin 
370 375 380 

teg gat gaa gag gac gag gaa tec gat eta ccc ttc cgc cag ate acg 13 81 
Ser Asp Glu Glu Asp Glu Glu Ser Asp Leu Pro Phe Arg Gin He Thr 
385 390 395 

gag atg acg ate tta acg gtc cag ttg ate gtc gag ttc gec aag ggt 1429 
Glu Met Thr He Leu Thr Val Gin Leu He Val Glu Phe Ala Lys Gly 
400 405 410 415 

eta ccg ggc ttt teg aag ata tea cag tct gat caa ate acc tta tta 1477 
Leu Pro Gly Phe Ser Lys He Ser Gin Ser Asp Gin He Thr Leu Leu 
420 425 430 

aaa gee teg tec age gag gtg atg atg ctg egg gtg gcg agg cga tac 1525 
Lys Ala Ser Ser Ser Glu Val Met Met Leu Arg Val Ala Arg Arg Tyr 
435 440 445 

gac gee gcg tec gac age gtg ctg ttc gec aac aac aag gcg tac acg 1573 
Asp Ala Ala Ser Asp Ser Val Leu Phe Ala Asn Asn Lys Ala Tyr Thr 
450 455 460 

cgc gac aac tac cgc caa ggc ggc atg gee tac gtc ate gaa gac etc 1621 
Arg Asp Asn Tyr Arg Gin Gly Gly Met Ala Tyr Val He Glu Asp Leu 
465 470 475 

eta cac ttc tgc egg tgc atg ttc gcg atg ggc atg gac aat gtg cac 1669 
Leu His Phe Cys Arg Cys Met Phe Ala Met Gly Met Asp Asn Val His 
480 485 490 495 

ttt gca ctg etc acg gec ate gtt ata ttc tea gat egg ccc ggg etc 1717 
Phe Ala Leu Leu Thr Ala He Val He Phe Ser Asp Arg Pro Gly Leu 
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500 505 510 

gag cag ccg teg ctg gta gaa gag ate cag aga tac tac ctg aac acg 1765 
Glu Gin Pro Ser Leu Val Glu Glu He Gin Arg Tyr Tyr Leu Asn Thr 
515 520 525 

ttg cga att tac ate ate aac cag aac age gcg teg teg cgc tgc gec 1813 
Leu Arg He Tyr He He Asn Gin Asn Ser Ala Ser Ser Arg Cys Ala 
530 535 540 

gtg ate tac ggc agg ate ctg age gtg ctg ace gag eta cgc acg etc 1861 
Val He Tyr Gly Arg He Leu Ser Val Leu Thr Glu Leu Arg Thr Leu 
545 550 555 

ggc acg caa aac tec aac atg tgc ate teg ctg aag ctg aag aac agg 1909 
Gly Thr Gin Asn Ser Asn Met Cys He Ser Leu Lys Leu Lys Asn Arg 
560 565 570 575 

aag ctg ccg ccg ttc etc gag gag ate tgg gac gtg gcg gag gtg gee 1957 
Lys Leu Pro Pro Phe Leu Glu Glu He Trp Asp Val Ala Glu Val Ala 
580 585 590 

acg acg cat ccc acg gtg ctg ccg ccc ace aac ccg gtg gtg eta 2002 
Thr Thr His Pro Thr Val Leu Pro Pro Thr Asn Pro Val Val Leu 
595 600 605 

tagcctccgc ccgccccagg agagaacget catagactgg ctagttttag tgaacgtgcg 2062 

ctgatccgta ttcggtgaca gattagtgat tatatgtgtt gttgaacgtt tggagagtat 2122 

atatatagtg ttgaeggega ggcccgtccg gccccgtact tgtttcgttt ctgaceggat 2182 

getgegtegg tcgcgccctt gcgaccacga taagactact ttctataagt aegtctctaa 2242 

attgaggece caagaacegg aagectaact aagatacgac ggtttatttt atcacagagg 2302 

aaactgaagt aattaatata ttaacagtta acacaggttc aaagcagtta ggccgctccg 2362 

gacttctcat ggtctcaata aggeggaegt ataagagtta catagagaat aaaataataa 2422 

tatatgaaga gatgtttcta ttggatggaa tgcgtgatgt aaagttgata gttattttta 2482 

tttaccaaat taatgaagcg tcgggtgtag accttttgta tgtgatgtgg cgaggagtgg 2542 

atcgcagtgt cggccgcgtg ctctcaccaa aagcgtgcgg tegaegctaa tagtgcgatg 2602 

gttttggaat atgtttgttt atatatagtt tatgtgtgag gtgttatcgt gtcccgtcaa 2662 

ttttaatttc gattcgegtt cattcgtcct gtgetegcta ctcagattt 2711 

<210> 3 

<211> 606 

<212> PRT 

<213> Bombyx mori 

<400> 3 

Met Arg Val Glu Asn Val Asp Asn Val Ser Phe Ala Leu Asn Gly Arg 

15 10 15 

Ala Asp Glu Trp Cys Met Ser Val Glu Thr Arg Leu Asp Ser Leu Val 

20 25 30 

Arg Glu Lys Ser Glu Val Lys Ala Tyr Val Gly Gly Cys Pro Ser Val 

35 40 45 

He Thr Asp Ala Gly Ala Tyr Asp Ala Leu Phe Asp Met Arg Arg Arg 

50 55 60 

Trp Ser Asn Asn Gly Gly Phe Pro Leu Arg Met Leu Glu Glu Ser Ser 
65 70 75 80 

Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu Pro Pro Ala Met Val 

85 90 95 

Met Ser Pro Glu Ser Leu Ala Ser Pro Glu Tyr Arg Ala Leu Glu Leu 

100 105 110 

Trp Ser Tyr Asp Asp Gly He Thr Tyr Asn Thr Ala Gin Ser Leu Leu 

115 120 125 

Gly Ala Cys Asn Met Gin Gin Gin Gin Leu Gin Pro Gin Gin Pro His 
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130 










135 






140 






Pro 
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Pro 


Pro 
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Pro 
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Met 


Pro Leu Pro 


Met 


Pro Pro Thr 


145 










150 
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160 
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Pro 
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Ser 


Glu 
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Glu 


Ser 


Met 


Ser Ser Gly Arg 


Glu Glu Leu 










165 










170 




175 


Ser 


Pro 


Ala 


Ser 


Ser 


He 


Asn 


Gly 


Cys 


Ser Ala Asp Ala 


Asp Ala Arg 








180 










185 






190 


Arg 


Gin 


Lys 


Lys 


Gly 


Pro 


Ala 


Pro 


Arg Gin Gin Glu Glu 


Leu Cys Leu 






195 










200 






205 




Val 


Cys 


Gly 


Asp 


Arg 


Ala 


Ser 


Gly 


Tyr His Tyr Asn Ala 


Leu Thr Cys 




210 










215 






220 






Glu 


Gly 


Cys 


Lys 


Gly 


Phe 


Phe 


Arg 


Arg 


Ser Val Thr 


Lys 


Asn Ala Val 


225 










230 








235 




240 


Tyr 


He 


Cys 


Lys 


Phe 


Gly 


His 


Ala 


Cys 


Glu Met Asp 


Met 


Tyr Met Arg 










245 










250 




255 


Arg 


Lys 


Cys 


Gin 


Glu 


Cys 


Arg 


Leu 


Lys 


Lys Cys Leu 


Ala 


Val Gly Met 








260 










265 






270 


Arg 


Pro 


Glu 


Cys 


Val 


He 


Gin 


Glu 


Pro 


Ser Lys Asn 


Lys 


Asp Arg Gin 






275 










280 






285 




Arg 


Gin 


Lys 


Lys 


Asp 


Lys 


Gly 


He 


Leu 


Leu Pro Val 


Ser 


Thr Thr Thr 




290 










295 






300 






Val 


Glu 


Asp 


His 


Met 


Pro 


Pro 


He 


Met 


Gin Cys Asp 


Pro 


Pro Pro Pro 


305 










310 








315 




320 


Glu 


Ala 


Ala 


Arg 


He 


His 


Glu 


Val 


Val 


Pro Arg Tyr 


Leu 


Ser Glu Lys 










325 










330 




335 


Leu 


Met 


Glu 


Gin 


Asn 


Arg 


Gin 


Lys 


Asn 


He Pro Pro 


Leu 


Ser Ala Asn 








340 










345 






350 


Gin 


Lys 


Ser 


Leu 


He 


Ala 


Arg 


Leu 


Val 


Trp Tyr Gin 


Glu 


Gly Tyr Glu 






355 










360 






365 




Gin 


Pro 


Ser 


Asp 


Glu 


Asp 


Leu 


Lys 


Arg Val Thr Gin 


Thr 


Trp Gin Ser 




370 










375 






380 






Asp 


Glu 


Glu 


Asp 


Glu 


Glu 


Ser 


Asp 


Leu 


Pro Phe Arg Gin 


He Thr Glu 


385 










390 








395 




400 


Met 


Thr 


He 


Leu 


Thr 


Val 


Gin 


Leu 


lie 


Val Glu Phe 


Ala 


Lys Gly Leu 










405 










410 




415 


Pro 


Gly 


Phe 


Ser 


Lys 


He 


Ser 


Gin 


Ser Asp Gin He 


Thr 


Leu Leu Lys 








420 










425 






430 


Ala 


Ser 


Ser 


Ser 


Glu 


Val 


Met 


Met 


Leu Arg Val Ala Arg 


Arg Tyr Asp 






435 










440 






445 




Ala 


Ala 


Ser 


Asp 


Ser 


Val 


Leu 


Phe 


Ala 


Asn Asn Lys 


Ala 


Tyr Thr Arg 




450 










455 






460 






Asp 


Asn 


Tyr 


Arg 


Gin 


Gly 


Gly 


Met 


Ala Tyr Val He 


Glu 


Asp Leu Leu 


465 










.470 








475 




480 


His 


Phe 


Cys 


Arg 


Cys 


Met 


Phe 


Ala 


Met Gly Met Asp Asn 


Val His Phe 










485 










490 




495 


Ala 


Leu 


Leu 


Thr 


Ala 


He 


Val 


He 


Phe 


Ser Asp Arg 


Pro 


Gly Leu Glu 








500 










505 






510 


Gin 


Pro 


Ser 


Leu 


Val 


Glu 


Glu 


He 


Gin Arg Tyr Tyr 


Leu 


Asn Thr Leu 






515 










520 






525 




Arg 


He 


Tyr 


He 


He 


Asn 


Gin 


Asn 


Ser 


Ala Ser Ser Arg 


Cys Ala Val 




530 










535 






540 






He 


Tyr 


Gly 


Arg 


He 


Leu 


Ser 


Val 


Leu Thr Glu Leu Arg 


Thr Leu Gly 


545 










550 








555 




560 


Thr 


Gin 


Asn 


Ser 


Asn 


Met 


Cys 


He 


Ser 


Leu Lys Leu Lys 


Asn Arg Lys 










565 










570 




575 


Leu 


Pro 


Pro 


Phe 


Leu 


Glu 


Glu 


He 


Trp Asp Val Ala Glu 


Val Ala Thr 








580 










585 






590 


Thr 


His 


Pro 


Thr 


Val 


Leu 


Pro 


Pro 


Thr 


Asn Pro Val 


Val 


Leu 



595 600 605 



<210> 4 
<211> 17 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> GAL4 response element 



<400> 4 
cggaggactg tcctccg 

<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



gagagaagct tatggcgcgc ccgaccgatg 

<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<400> 6 

cacacacgcg tgtactcgtc aattccaag 

<210> 7 
<211> 1977 
<212> DNA 

<213> Artificial Sequence 



<400> 7 

atggcgcgcc cgaccgatgt cagcctgggg gacgagctcc acttagacgg cgaggacgtg 
gcgatggcgc atgccgacgc gctagacgat ttcgatctgg acatgttggg ggacggggat 
tccccgggtc cgggatttac cccccacgac tccgccccct acggcgctct ggatatggcc 
gacttcgagt ttgagcagat gtttaccgat gcccttggaa ttgacgagta cacgcgttta 
gatagtttag tgcgagaaaa aagtgaagtg aaagcctacg tcggaggatg tccctcggta 
atcacggatg ctggagcgta tgacgcgctc ttcgacatga gacgccgctg gtctaataac 
ggtggcttcc cgctgcgaat gcttgaagag agctcttcag aagtgacatc gtcttcggca 
ctgggtttgc caccggccat ggttatgtcg ccggaatcct tggcgtcgcc cgagtatcga 
gccctcgagc tatggagcta cgatgacgga atcacttata atacagccca gtctctgctg 
ggtgcatgca atatgcaaca gcaacagcta caacctcagc aaccacatcc agcaccaccg 
acgctcccca cgatgccttt accaatgcct cccacaacac cgaaatcaga aaatgaatcg 
atgtcatcag gtcgagagga actttcgccg gcttcaagca taaatggctg cagtgctgat 
gctgacgcca gacggcagaa gaaaggtcct gcacctcgac agcaagagga gctatgtctt 
gtctgcggcg acagagcctc cggataccac tacaacgcac tgacgtgtga aggatgcaaa 
ggattcttca ggcggagtgt caccaaaaac gcagtatata tttgtaaatt tggacatgcc 
tgtgaaatgg atatgtacat gaggaggaaa tgtcaagagt gtcgattaaa gaaatgtcta 
gcggtaggaa tgaggcctga atgtgtcata caggagccca gtaaaaataa agacaggcaa 
agacaaaaga aagacaaagg aatattatta cctgttagta cgaccacagt cgaagaccac 
atgcccccga tcatgcaatg tgatccacct ccgcccgagg ccgccaggat tcacgaagtc 
gtcccgaggt atctttcgga gaagctgatg gagcagaaca ggcagaagaa cataccacca 
tt gtcggcga atcagaagtc tctgatcgcg aggctcgtgt ggtaccagga gggatatgag 
cagccctccg acgaggatct caaaagagta acgcagactt ggcagtcgga tgaagaggac 
gaggaatccg atctaccctt ccgccagatc acggagatga cgatcttaac ggtccagttg 
atcgtcgagt tcgccaaggg tctaccgggc ttttcgaaga tatcacagtc tgatcaaatc 
accttattaa aagcctcgtc cagcgaggtg atgatgctgc gggtggcgag gcgatacgac 
gccgcgtccg acagcgtgct gttcgccaac aacaaggcgt acacgcgcga caactaccgc 
caaggcggca tggcctacgt catcgaagac ctcctacact tctgccggtg catgttcgcg 
atgggcatgg acaatgtgca ctttgcactg ctcacggcca tcgttatatt ctcagatcgg 
cccgggctcg agcagccgtc gctggtagaa gagatccaga gatactacct gaacacgttg 
cgaatttaca tcatcaacca gaacagcgcg tcgtcgcgct gcgccgtgat ctacggcagg 
atcctgagcg tgctgaccga gctacgcacg ctcggcacgc aaaactccaa catgtgcatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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tcgctgaagc tgaagaacag gaagctgccg ccgttcctcg aggagatctg ggacgtggcg 1920 

gaggtggcca cgacgcatcc cacggtgctg ccgcccacca acccggtggt gctatag 1977 

<210> 8 
<211> 9072 
<212> DNA 

<213> Artificial Sequence 
<40O> 8 

tatggaggga tacccgtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 60 

ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 120 

aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 180 

tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 240 

atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 300 

agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 360 

tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 420 

tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 4 80 

atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 540 

ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 600 

tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 660 

acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 720 

ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 780 

aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 840 

ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc 900 

cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata 960 

gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 1020 

actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 1080 

agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 1140 

cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 1200 

tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 1260 

agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 1320 

tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 1380 

acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 1440 

ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 1500 

gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 1560 

gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 1620 

gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 1680 

tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 1740 

caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 1800 

tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc 1860 

gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg 1920 

agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt 1980 

ggccgattca ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc 2040 

gcaacgcaat taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc 2100 

ttccggctcg tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct 2160 

atgaccatga ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc 2220 

tagcggccgc aataaaataa aagattttat ttagtctcca gaaaaagggg ggaatgaaag 2280 

accccacctg taggtttggc aagctagctt aagtaacgcc attttgcaag gcatggaaaa 2340 

atacataact gagaatagag aagttcagat caaggtcagg aacagatgga acagctgaat 2400 

atgggccaaa caggatatct gtggtaagca gttcctgccc cggctcaggg ccaagaacag 2460 

atggaacagc tgaatatggg ccaaacagga tatctgtggt aagcagttcc tgccccggct 2520 

cagggccaag aacagatggt ccccagatgc ggtccagccc tcagcagttt ctagaacaag 2580 

ggttcaatgc acttgtcgct agaacaaggg ttcaatgcac ttgtcgctag aacaagggtt 2640 

caatgcactt gtcgctagaa caagggttca atgcacttgt cgagaaccat cagatgtttc 2700 

cagggtgccc caaggacctg aaatgaccct gtgccttatt tgaactaacc aatcagttcg 2760 

cttctcgctt ctgttcgcgc gcttctgctc cccgagctca ataaaagagc ccacaacccc 2820 

tcactcgggg cgccagtcct ccgattgact gagacgcgtc ggggtacccg tattcccaat 2880 

aaagcctctt gctgttgcat ccgaatcgtg gactcgctga tccttgggag ggtctcctca 2940 

gattgattga ctgcccacct cgggggtctt tcatttggag gttccaccga gattggagac 3 000 

ccctgcctag ggaccaccga ccccccccgc cgggaggtaa gctggccagc ggtcggtttc 3 060 

gtgtctgtct ctgtctttgt gcgtgtttgt gccggcatct aatgtttgcg cctgcgtctg 3120 

tactagttag ctaactagct ctgtatctgg cggacccgtg gtggaactga cgagttctga 3180 
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acacccggcc gcaaccctgg gagacgtccc agggactttg ggggccgttt ttgtggcccg 3240 

acctgaggaa gggagtcgat gtggaatccg accccgtcag gatatgtggt tctggtagga 3300 

gacgagaacc taaaacagtt cccgcctccg tctgaatttt tgctttcggt ttggaaccga 3360 

agccgcgcgt cttgtctgct gcagcatcgt tctgtgttgt ctctgtctga ctgtgtttct 3420 

gtatttgtct gaaaattagg gccagactgt taccactccc ttaagtttga ccttaggtca 3480 

ctggaaagat gtcgagcgga tcgctcacaa ccagtcggta gatgtcaaga agagacgttg 3540 

ggttaccttc tgctctgcag aatggccaac ctttaacgtc ggatggccgc gagacggcac 3600 

ctttaaccga gacctcatca cccaggttaa gatcaaggtc ttttcacctg gcccgcatgg 3660 

acacccagac caggtcccct acatcgtgac ctgggaagcc ttggcttttg acccccctcc 3720 

ctgggtcaag ccctttgtac accctaagcc tccgcctcct cttcctccat ccgccccgtc 3780 

tctccccctt gaacctcctc gttcgacccc gcctcgatcc tccctttatc cagccctcac 3840 

tccttctcta ggcgccggaa ttcggcgcgt gtggcctcga acaccgagcg accctgcagc 3 900 

gacccgctta acagcgtaag cttgtcgacg gtatcgataa gcttatggcg cgcccgaccg 3960 

atgtcagcct gggggacgag ctccacttag acggcgagga cgtggcgatg gcgcatgccg 4020 

acgcgctaga cgatttcgat ctggacatgt tgggggacgg ggattccccg ggtccgggat 4080 

ttacccccca cgactccgcc ccctacggcg ctctggatat ggccgacttc gagtttgagc 4140 

agatgtttac cgatgccctt ggaattgacg agtacacgcg tttagatagt ttagtgcgag 4200 

aaaaaagtga agtgaaagcc tacgtcggag gatgtccctc ggtaatcacg gatgctggag 4260 

cgtatgacgc gctcttcgac atgagacgcc gctggtctaa taacggtggc ttcccgctgc 4320 

gaatgcttga agagagctct tcagaagtga catcgtcttc ggcactgggt ttgccaccgg 43 80 

ccatggttat gtcgccggaa tccttggcgt cgcccgagta tcgagccctc gagctatgga 4440 

gctacgatga cggaatcact tataatacag cccagtctct gctgggtgca tgcaatatgc 4 500 

aacagcaaca gctacaacct cagcaaccac atccagcacc accgacgctc cccacgatgc 4 560 

ctttaccaat gcctcccaca acaccgaaat cagaaaatga atcgatgtca tcaggtcgag 4620 

aggaactttc gccggcttca agcataaatg gctgcagtgc tgatgctgac gccagacggc 4680 

agaagaaagg tcctgcacct cgacagcaag aggagctatg tcttgtctgc ggcgacagag 4740 

cctccggata ccactacaac gcactgacgt gtgaaggatg caaaggattc ttcaggcgga 4 800 

gtgtcaccaa aaacgcagta tatatttgta aatttggaca tgcctgtgaa atggatatgt 4860 

acatgaggag gaaatgtcaa gagtgtcgat taaagaaatg tctagcggta ggaatgaggc 4920 

ctgaatgtgt catacaggag cccagtaaaa ataaagacag gcaaagacaa aagaaagaca 4980 

aaggaatatt attacctgtt agtacgacca cagtcgaaga ccacatgccc ccgatcatgc 5040 

aatgtgatcc acctccgccc gaggccgcca ggattcacga agtcgtcccg aggtatcttt 5100 

cggagaagct gatggagcag aacaggcaga agaacatacc accattgtcg gcgaatcaga 5160 

agtctctgat cgcgaggctc gtgtggtacc aggagggata tgagcagccc tccgacgagg 5220 

atctcaaaag agtaacgcag acttggcagt cggatgaaga ggacgaggaa tccgatctac 5280 

ccttccgcca gatcacggag atgacgatct taacggtcca gttgatcgtc gagttcgcca 5340 

agggtctacc gggcttttcg aagatatcac agtctgatca aatcacctta ttaaaagcct 5400 

cgtccagcga ggtgatgatg ctgcgggtgg cgaggcgata cgacgccgcg tccgacagcg 5460 

tgctgttcgc caacaacaag gcgtacacgc gcgacaacta ccgccaaggc ggcatggcct 5520 

acgtcatcga agacctccta cacttctgcc ggtgcatgtt cgcgatgggc atggacaatg 5580 

tgcactttgc actgctcacg gccatcgtta tattctcaga tcggcccggg ctcgagcagc 5640 

cgtcgctggt agaagagatc cagagatact acctgaacac gttgcgaatt tacatcatca 5700 

accagaacag cgcgtcgtcg cgctgcgccg tgatctacgg caggatcctg agcgtgctga 5760 

ccgagctacg cacgctcggc acgcaaaact ccaacatgtg catctcgctg aagctgaaga 5820 

acaggaagct gccgccgttc ctcgaggaga tctgggacgt ggcggaggtg gccacgacgc 5880 

atcccacggt gctgccgccc accaacccgg tggtgctata gcctccgccc gccccaggag 5940 

agaacgctca tagactggct agttttagtg aacgtgcgct gatccgtatt cggtgacaga 6000 

ttagtgatta tatgtgttgt tgaacgtttg gagagtatat atatagtgtt atcgaattcc 6060 

gcccctctcc ctcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 6120 

gtgcgtttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 6180 

ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 6240 

gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 6300 

aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 6360 

tctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 6420 

acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 6480 

aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 6540 

gcacatgctt tacgtgtgtt tagtcgaggt taaaaaacgt ctaggccccc cgaaccacgg 6600 

ggacgtggtt ttcctttgaa aaacacgatg ataatatggc cacaaccatg gcatgattga 6660 

acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat tcggctatga 6720 

ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt cagcgcaggg 6780 

gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac tgcaggacga 684 0 

ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg tgctcgacgt 6900 
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tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc aggatctcct 6960 

gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa tgcggcggct 7020 

gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg 7080 

agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg aagagcatca 714 0 

ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg acggcgagga 7200 

tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa atggccgctt 7260 

ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg acatagcgtt 7320 

ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct 73 80 

ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc ttgacgagtt 7440 

cttctgagcg ggactctggg gttcgaacct gtaggtttgg caagctagaa caagggttca 7500 

atgcacttgt cgctagaaca agggttcaat gcacttgtcg ctagaacaag ggttcaatgc 7560 

acttgtcgct agaacaaggg ttcaatgcac ttgtcgagaa ccatgggtcg agtaggcgtg 7620 

tacggtggga ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagac 7680 

gccatccaac gctgttttgc ctccaaaaga caccgggacc gaggccgcct gggcccgttt 7740 

aaacatcgat aaaataaaag attttattta gtctccagaa aaagggggga atgaaagacc 7800 

ccacctgtag gtttggcaag ctagcttaag taacgccatt ttgcaaggca tggaaaaata 7860 

cataactgag aatagagaag ttcagatcaa ggtcaggaac agatggaaca gctgaatatg 7920 

ggccaaacag gatatctgtg gtaagcagtt cctgccccgg ctcagggcca agaacagatg 7980 

gaacagctga atatgggcca aacaggatat ctgtggtaag cagttcctgc cccggctcag 8040 

ggccaagaac agatggtccc cagatgcggt ccagccctca gcagtttcta gaacaagggt 8100 

tcaatgcact tgtcgctaga acaagggttc aatgcacttg tcgctagaac aagggttcaa 8160 

tgcacttgtc gctagaacaa gggttcaatg cacttgtcga gaaccatcag atgtttccag 8220 

ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga actaaccaat cagttcgctt 8280 

ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata aaagagccca caacccctca 8340 

ctcggggcgc cagtcctccg attgactgag tcgcccgggt acccgtattc ccaataaacc 8400 

tcttgctgat gcatggtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg 8460 

gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 8520 

gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 8580 

tcccaacagt tgcgcagcct gaatggcgaa tggaaattgt aagcgttaat attttgttaa 8640 

aattcgcgtt aaatttttgt taaatcagct cattttttaa ccaataggcc gaaatcggca 8700 

aaatccctta taaatcaaaa gaatagaccg agatagggtt gagtgttgtt ccagtttgga 8760 

acaagagtcc actattaaag aacgtggact ccaacgtcaa agggcgaaaa accgtctatc 8820 

agggcgatgg cccactacgt gaaccatcac cctaatcaag ttttttgggg tcgaggtgcc 8880 

gtaaagcact aaatcggaac cctaaaggga gcccccgatt tagagcttga cggggaaagc 8940 

cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg agcgggcgct agggcgctgg 9000 
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